... | @@ -154,7 +154,7 @@ In this step, CSV syntax marks are removed from the input, to leave the raw CSV |
... | @@ -154,7 +154,7 @@ In this step, CSV syntax marks are removed from the input, to leave the raw CSV |
|
4. CR before LF should be deleted to uniformly standardize on LF as line separator.
|
|
4. CR before LF should be deleted to uniformly standardize on LF as line separator.
|
|
|
|
|
|
Compression is achieved by creating a mask of 1 bits for all character positions that are to be kept.
|
|
Compression is achieved by creating a mask of 1 bits for all character positions that are to be kept.
|
|
In this mask, all character positions to be deleted are marked with 0 bits. Call the mask 'CSV_data_mask`.
|
|
In this mask, all character positions to be deleted are marked with 0 bits. Call the mask `CSV_data_mask`.
|
|
|
|
|
|
Given this mask, we need to apply FilterByMask to produce two streamsets:
|
|
Given this mask, we need to apply FilterByMask to produce two streamsets:
|
|
1. `FilteredBasisBits` is produced by filtering the eight basis bit streams.
|
|
1. `FilteredBasisBits` is produced by filtering the eight basis bit streams.
|
... | | ... | |