tsv_summarize

Command line tool that reads TSV files and summarizes field values associated with equivalent keys.

Copyright (c) 2016-2018, eBay Software Foundation Initially written by Jon Degenhardt

Members

Classes

CountOperator
class CountOperator

CountOperator counts the number of occurrences of each unique key, or the number of input lines if there is no unique key.

FirstOperator
class FirstOperator

FirstOperator outputs the first value found for the field.

KeySummarizerBase
class KeySummarizerBase(OutputRange)
Undocumented in source.
LastOperator
class LastOperator

LastOperator outputs the last value found for the field.

MadOperator
class MadOperator
Undocumented in source.
MaxOperator
class MaxOperator
Undocumented in source.
MeanOperator
class MeanOperator
Undocumented in source.
MedianOperator
class MedianOperator
Undocumented in source.
MinOperator
class MinOperator
Undocumented in source.
MissingCountOperator
class MissingCountOperator
Undocumented in source.
MissingFieldPolicy
class MissingFieldPolicy
Undocumented in source.
ModeCountOperator
class ModeCountOperator
Undocumented in source.
ModeOperator
class ModeOperator
Undocumented in source.
MultiKeySummarizer
class MultiKeySummarizer(OutputRange)
Undocumented in source.
NoKeySummarizer
class NoKeySummarizer(OutputRange)
Undocumented in source.
NotMissingCountOperator
class NotMissingCountOperator
Undocumented in source.
OneKeySummarizer
class OneKeySummarizer(OutputRange)
Undocumented in source.
QuantileOperator
class QuantileOperator
Undocumented in source.
RangeOperator
class RangeOperator
Undocumented in source.
RetainOperator
class RetainOperator

RetainOperator retains the first occurrence of a field, without changing the header.

SharedFieldValues
class SharedFieldValues
Undocumented in source.
SingleFieldCalculator
class SingleFieldCalculator
Undocumented in source.
SingleFieldOperator
class SingleFieldOperator
Undocumented in source.
StDevOperator
class StDevOperator
Undocumented in source.
SumOperator
class SumOperator
Undocumented in source.
SummarizerBase
class SummarizerBase(OutputRange)
Undocumented in source.
UniqueCountOperator
class UniqueCountOperator
Undocumented in source.
UniqueKeyValuesLists
class UniqueKeyValuesLists
Undocumented in source.
UniqueValuesOperator
class UniqueValuesOperator
Undocumented in source.
ValuesOperator
class ValuesOperator
Undocumented in source.
VarianceOperator
class VarianceOperator
Undocumented in source.
ZeroFieldCalculator
class ZeroFieldCalculator
Undocumented in source.
ZeroFieldOperator
class ZeroFieldOperator
Undocumented in source.

Functions

fieldHeaderFromIndex
string fieldHeaderFromIndex(size_t fieldIndex)
Undocumented in source. Be warned that the author may not have intended to support it.
main
int main(string[] cmdArgs)
Undocumented in source. Be warned that the author may not have intended to support it.
summaryHeaderFromFieldHeader
string summaryHeaderFromFieldHeader(string fieldHeader, string operationName)
Undocumented in source. Be warned that the author may not have intended to support it.
testSingleFieldOperator
void testSingleFieldOperator(char[][][] splitFile, size_t fieldIndex, string headerSuffix, char[][] expectedValues, MissingFieldPolicy missingPolicy)

A helper for SingleFieldOperator unit tests.

testSingleFieldOperatorBase
void testSingleFieldOperatorBase(char[][][] splitFile, size_t fieldIndex, string headerSuffix, char[][] expectedValues, MissingFieldPolicy missingPolicy, T extraOpInitArgs)
Undocumented in source. Be warned that the author may not have intended to support it.
testSummarizer
void testSummarizer(string[] cmdArgs, string[][] file, string[][] expected)
Undocumented in source. Be warned that the author may not have intended to support it.
testZeroFieldOperator
void testZeroFieldOperator(char[][][] splitFile, string defaultHeader, char[][] expectedValues)

A helper for ZeroFieldOperator unit tests.

tsvSummarize
void tsvSummarize(TsvSummarizeOptions cmdopt, string[] inputFiles)

tsvSummarize does the primary work of the tsv-summarize program.

Interfaces

Calculator
interface Calculator
Undocumented in source.
Operator
interface Operator
Undocumented in source.
Summarizer
interface Summarizer(OutputRange)
Undocumented in source.

Structs

SummarizerPrintOptions
struct SummarizerPrintOptions
Undocumented in source.
TsvSummarizeOptions
struct TsvSummarizeOptions

Command line options - Container and processing. The processArgs method is used to process the command line.

Variables

helpText
auto helpText;
Undocumented in source.
helpTextVerbose
auto helpTextVerbose;
Undocumented in source.

Meta