Skip to content

GitLab

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
P parabix-devel
  • Project overview
    • Project overview
    • Details
    • Activity
    • Releases
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 8
    • Issues 8
    • List
    • Boards
    • Labels
    • Service Desk
    • Milestones
  • Merge requests 2
    • Merge requests 2
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Operations
    • Operations
    • Incidents
    • Environments
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • CI/CD
    • Repository
    • Value Stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Members
    • Members
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • cameron
  • parabix-devel
  • Wiki
  • UCD: Unicode Property Database and Compilers

UCD: Unicode Property Database and Compilers · Changes

Page history
Update UCD: Unicode Property Database and Compilers authored May 22, 2024 by cameron's avatar cameron
Show whitespace changes
Inline Side-by-side
Showing with 5 additions and 4 deletions
+5 -4
  • UCD:-Unicode-Property-Database-and-Compilers.md UCD:-Unicode-Property-Database-and-Compilers.md +5 -4
  • No files found.
UCD:-Unicode-Property-Database-and-Compilers.md
View page @ 0467dea6
......@@ -79,15 +79,16 @@ llvm::cast<UCD::EnumeratedPropertyObject>(UCD::property_object_table[UCD:sc])->G
```
## Unicode Property Resolution
## Unicode Property Resolution and Compilation
When a regular expression `Name` object refers to a Unicode character property, resolving that name
into its corresponding UnicodeSet may be performed using operations defined in `re/unicode/resolve_properties.h`.
The `UnicodePropertyKernelBuilder` in `include/kernel/unicode/UCD_property_kernel.h` is useful for constructing property streams
Direct compilation of Unicode properties can be achieved by the
`UnicodePropertyKernelBuilder` in `include/kernel/unicode/UCD_property_kernel.h`.
It is useful for constructing property streams
for any of the defined Unicode properties. An example of its use for counting
occurrences of a property within a file may be found in the `ucount` utility,
see `tools/wc/ucount.cpp`.
occurrences of a property within a file may be found in the [ucount](https://cs-git-research.cs.sfu.ca/cameron/parabix-devel/-/tree/master//tools/wc/ucount.cpp) utility.
## Grapheme Cluster Boundaries
......
Clone repository
  • Bracket Matching
  • CSV Validation
  • CSVediting
  • CSVparsing
  • Character Code Compilers
  • KernelLibrary
  • Pablo
  • ParabixTransform
  • Parallel Deletion
  • Parallel Hashing
  • Performance Testing Script
  • Shuffle Pattern Library
  • StaticCCC
  • String Insertion
  • UCD: Unicode Property Database and Compilers
View All Pages