Skip to content

GitLab

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
P parabix-devel
  • Project overview
    • Project overview
    • Details
    • Activity
    • Releases
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 9
    • Issues 9
    • List
    • Boards
    • Labels
    • Service Desk
    • Milestones
  • Merge requests 2
    • Merge requests 2
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Operations
    • Operations
    • Incidents
    • Environments
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • CI/CD
    • Repository
    • Value Stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Members
    • Members
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • cameron
  • parabix-devel
  • Wiki
  • CSVediting

CSVediting · Changes

Page history
Update CSVediting authored Nov 29, 2022 by cameron's avatar cameron
Hide whitespace changes
Inline Side-by-side
Showing with 16 additions and 2 deletions
+16 -2
  • CSVediting.md CSVediting.md +16 -2
  • No files found.
CSVediting.md
View page @ 41f7af4e
......@@ -143,10 +143,24 @@ P->CreateKernelCall<LineSpansKernel>(MatchedLineStarts, MatchedLineEnds, Matched
If FilterByMask was used at this point, you would get the matched rows. To delete
the matched rows, the MatchedLineSpans must be negated (use a Pablo createNot operation).
## Constraining Matches to a Column
Matches can be constrained to a particular column. The general method for
matches within a row can be modified by using a column mask having 1 bits only
within the column and supplying that as input to the regular expression
matching process. This will require a modification to the
options of the ICGrep kernel, to supply this as an option when
calling the RE compiler. When the ICGrep kernel is called,
the mask must then be passed as the second
parameter to the compileRE method of the RE compiler.
```
Marker compileRE(RE * re, Marker initialMarkers);
```
If the marker stream returned by the RE_compiler has a one bit anywhere
within the column, then a match is found.
## Combining Masks
If masks are computed to edit out both a column and a row, these can be combined with a Pablo
createAnd operation and then one FilterByMask can be applied.
Clone repository
  • Bracket Matching
  • CSV Validation
  • CSVediting
  • CSVparsing
  • Character Code Compilers
  • KernelLibrary
  • Pablo
  • ParabixTransform
  • Parallel Deletion
  • Parallel Hashing
  • Performance Testing Script
  • Shuffle Pattern Library
  • StaticCCC
  • String Insertion
  • UCD: Unicode Property Database and Compilers
View All Pages