TY - GEN
T1 - Improving and extending the testing of distributions for shape-restricted properties
AU - Fischer, Eldar
AU - Lachish, Oded
AU - Vasudev, Yadu
N1 - Publisher Copyright: © Eldar Fischer, Oded Lachish, and Yadu Vasudev.
PY - 2017/3/1
Y1 - 2017/3/1
N2 - Distribution testing deals with what information can be deduced about an unknown distribution over {1,⋯, n}, where the algorithm is only allowed to obtain a relatively small number of independent samples from the distribution. In the extended conditional sampling model, the algorithm is also allowed to obtain samples from the restriction of the original distribution on subsets of {1,⋯, n}. In 2015, Canonne, Diakonikolas, Gouleakis and Rubinfeld unified several previous results, and showed that for any property of distributions satisfying a "decomposability" criterion, there exists an algorithm (in the basic model) that can distinguish with high probability distributions satisfying the property from distributions that are far from it in variation distance. We present here a more efficient yet simpler algorithm for the basic model, as well as very efficient algorithms for the conditional model, which until now was not investigated under the umbrella of decomposable properties. Additionally, we provide an algorithm for the conditional model that handles a much larger class of properties. Our core mechanism is a way of efficiently producing an interval-partition of {1,⋯, n} that satisfies a "fine-grain" quality. We show that with such a partition at hand we can directly move forward with testing individual intervals, instead of first searching for the "correct" partition of {1,⋯, n}.
AB - Distribution testing deals with what information can be deduced about an unknown distribution over {1,⋯, n}, where the algorithm is only allowed to obtain a relatively small number of independent samples from the distribution. In the extended conditional sampling model, the algorithm is also allowed to obtain samples from the restriction of the original distribution on subsets of {1,⋯, n}. In 2015, Canonne, Diakonikolas, Gouleakis and Rubinfeld unified several previous results, and showed that for any property of distributions satisfying a "decomposability" criterion, there exists an algorithm (in the basic model) that can distinguish with high probability distributions satisfying the property from distributions that are far from it in variation distance. We present here a more efficient yet simpler algorithm for the basic model, as well as very efficient algorithms for the conditional model, which until now was not investigated under the umbrella of decomposable properties. Additionally, we provide an algorithm for the conditional model that handles a much larger class of properties. Our core mechanism is a way of efficiently producing an interval-partition of {1,⋯, n} that satisfies a "fine-grain" quality. We show that with such a partition at hand we can directly move forward with testing individual intervals, instead of first searching for the "correct" partition of {1,⋯, n}.
KW - Conditional sampling
KW - Distribution testing
KW - Property testing
KW - Statistics
UR - http://www.scopus.com/inward/record.url?scp=85016206871&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.STACS.2017.31
DO - 10.4230/LIPIcs.STACS.2017.31
M3 - منشور من مؤتمر
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 34th Symposium on Theoretical Aspects of Computer Science, STACS 2017
A2 - Vallee, Brigitte
A2 - Vollmer, Heribert
T2 - 34th Symposium on Theoretical Aspects of Computer Science, STACS 2017
Y2 - 8 March 2017 through 11 March 2017
ER -