... | ... | @@ -79,15 +79,16 @@ llvm::cast<UCD::EnumeratedPropertyObject>(UCD::property_object_table[UCD:sc])->G |
|
|
```
|
|
|
|
|
|
|
|
|
## Unicode Property Resolution
|
|
|
## Unicode Property Resolution and Compilation
|
|
|
|
|
|
When a regular expression `Name` object refers to a Unicode character property, resolving that name
|
|
|
into its corresponding UnicodeSet may be performed using operations defined in `re/unicode/resolve_properties.h`.
|
|
|
|
|
|
The `UnicodePropertyKernelBuilder` in `include/kernel/unicode/UCD_property_kernel.h` is useful for constructing property streams
|
|
|
Direct compilation of Unicode properties can be achieved by the
|
|
|
`UnicodePropertyKernelBuilder` in `include/kernel/unicode/UCD_property_kernel.h`.
|
|
|
It is useful for constructing property streams
|
|
|
for any of the defined Unicode properties. An example of its use for counting
|
|
|
occurrences of a property within a file may be found in the `ucount` utility,
|
|
|
see `tools/wc/ucount.cpp`.
|
|
|
occurrences of a property within a file may be found in the [ucount](https://cs-git-research.cs.sfu.ca/cameron/parabix-devel/-/tree/master//tools/wc/ucount.cpp) utility.
|
|
|
|
|
|
## Grapheme Cluster Boundaries
|
|
|
|
... | ... | |