Profiler.Column, Profiler.Distribution, Profiler.FunctionalDependency, Profiler.Profile, Profiler.RowCount, Profiler.Statistic, Profiler.Unique| Constructor and Description |
|---|
SimpleProfiler() |
| Modifier and Type | Method and Description |
|---|---|
Profiler.Profile |
profile(Iterable<List<Comparable>> rows,
List<Profiler.Column> columns,
Collection<ImmutableBitSet> initialGroups)
Creates a profile of a data set.
|
static double |
surprise(double expected,
double actual)
Returns a measure of how much an actual value differs from expected.
|
public Profiler.Profile profile(Iterable<List<Comparable>> rows, List<Profiler.Column> columns, Collection<ImmutableBitSet> initialGroups)
Profilerprofile in interface Profilerrows - List of rows. Can be iterated over more than once (maybe not
cheaply)columns - Column definitionsinitialGroups - List of combinations of columns that should be
profiled early, because they may be interestingpublic static double surprise(double expected,
double actual)
abs(expected - actual) / (expected + actual).
Examples:
expected - Expected valueactual - Actual valueCopyright © 2012-2022 Apache Software Foundation. All Rights Reserved.