tsv_sample

Command line tool implementing weighted reservoir sampling on delimited data files. Weights are read from a field in the file.

Copyright (c) 2017-2018, eBay Software Foundation Initially written by Jon Degenhardt

Members

Functions

distinctSampling
void distinctSampling(TsvSampleOptions cmdopt, OutputRange outputStream)
Undocumented in source. Be warned that the author may not have intended to support it.
getFieldValue
T getFieldValue(C[] line, size_t fieldIndex, C delim, string filename, size_t lineNum)
Undocumented in source. Be warned that the author may not have intended to support it.
main
int main(string[] cmdArgs)
Undocumented in source. Be warned that the author may not have intended to support it.
reservoirSampling
void reservoirSampling(TsvSampleOptions cmdopt, OutputRange outputStream)
Undocumented in source. Be warned that the author may not have intended to support it.
streamSampling
void streamSampling(TsvSampleOptions cmdopt, OutputRange outputStream)
Undocumented in source. Be warned that the author may not have intended to support it.
testTsvSample
void testTsvSample(string[] cmdArgs, string[][] expected)
Undocumented in source. Be warned that the author may not have intended to support it.

Structs

TsvSampleOptions
struct TsvSampleOptions
Undocumented in source.

Variables

helpText
auto helpText;
Undocumented in source.
helpTextVerbose
auto helpTextVerbose;
Undocumented in source.

Meta