Skip to content

GitLab

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
P parabix-devel
  • Project overview
    • Project overview
    • Details
    • Activity
    • Releases
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 8
    • Issues 8
    • List
    • Boards
    • Labels
    • Service Desk
    • Milestones
  • Merge requests 2
    • Merge requests 2
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Operations
    • Operations
    • Incidents
    • Environments
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • CI/CD
    • Repository
    • Value Stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Members
    • Members
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • cameron
  • parabix-devel
  • Wiki
  • UCD: Unicode Property Database and Compilers

UCD: Unicode Property Database and Compilers · Changes

Page history
Update UCD: Unicode Property Database and Compilers authored May 22, 2024 by cameron's avatar cameron
Show whitespace changes
Inline Side-by-side
Showing with 5 additions and 4 deletions
+5 -4
  • UCD:-Unicode-Property-Database-and-Compilers.md UCD:-Unicode-Property-Database-and-Compilers.md +5 -4
  • No files found.
UCD:-Unicode-Property-Database-and-Compilers.md
View page @ 0467dea6
...@@ -79,15 +79,16 @@ llvm::cast<UCD::EnumeratedPropertyObject>(UCD::property_object_table[UCD:sc])->G ...@@ -79,15 +79,16 @@ llvm::cast<UCD::EnumeratedPropertyObject>(UCD::property_object_table[UCD:sc])->G
``` ```
## Unicode Property Resolution ## Unicode Property Resolution and Compilation
When a regular expression `Name` object refers to a Unicode character property, resolving that name When a regular expression `Name` object refers to a Unicode character property, resolving that name
into its corresponding UnicodeSet may be performed using operations defined in `re/unicode/resolve_properties.h`. into its corresponding UnicodeSet may be performed using operations defined in `re/unicode/resolve_properties.h`.
The `UnicodePropertyKernelBuilder` in `include/kernel/unicode/UCD_property_kernel.h` is useful for constructing property streams Direct compilation of Unicode properties can be achieved by the
`UnicodePropertyKernelBuilder` in `include/kernel/unicode/UCD_property_kernel.h`.
It is useful for constructing property streams
for any of the defined Unicode properties. An example of its use for counting for any of the defined Unicode properties. An example of its use for counting
occurrences of a property within a file may be found in the `ucount` utility, occurrences of a property within a file may be found in the [ucount](https://cs-git-research.cs.sfu.ca/cameron/parabix-devel/-/tree/master//tools/wc/ucount.cpp) utility.
see `tools/wc/ucount.cpp`.
## Grapheme Cluster Boundaries ## Grapheme Cluster Boundaries
......
Clone repository
  • Bracket Matching
  • CSV Validation
  • CSVediting
  • CSVparsing
  • Character Code Compilers
  • KernelLibrary
  • Pablo
  • ParabixTransform
  • Parallel Deletion
  • Parallel Hashing
  • Performance Testing Script
  • Shuffle Pattern Library
  • StaticCCC
  • String Insertion
  • UCD: Unicode Property Database and Compilers
View All Pages