ACM Home Page
Please provide us with feedback. Feedback
Testing symmetric properties of distributions
Full text PdfPdf (318 KB)
Source
Annual ACM Symposium on Theory of Computing archive
Proceedings of the 40th annual ACM symposium on Theory of computing table of contents
Victoria, British Columbia, Canada
SESSION: 9A table of contents
Pages 383-392  
Year of Publication: 2008
ISBN:978-1-60558-047-0
Author
Paul Valiant  MIT, Cambridge, MA, USA
Sponsors
ACM: Association for Computing Machinery
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 70,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1374376.1374432
What is a DOI?

ABSTRACT

We introduce the notion of a Canonical Tester for a class of properties on distributions, that is, a tester strong and general enough that "a distribution property in the class is testable if and only if the Canonical Tester tests it". We construct a Canonical Tester for the class of symmetric properties of one or two distributions, satisfying a certain weak continuity condition. Analyzing the performance of the Canonical Tester on specific properties resolves several open problems, establishing lower bounds that match known upper bounds: we show that distinguishing between entropy <α or >β on distributions over [n] requires nα/β- o(1) samples, and distinguishing whether a pair of distributions has statistical distance <α or >β requires n1-o(1) samples. Our techniques also resolve a conjecture about a property that our Canonical Tester does not apply to: distinguishing identical distributions from those with statistical distance >β requires Ω(n2/3) samples.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
. Alon, A. Andoni, T. Kaufman, K. Matulef, R. Rubinfeld, and N. Xie. Testing k--wise and Almost k--wise Independence. STOC 2007.
 
2
. Alon, E. Fischer, I. Newman, and A. Shapira. A combinatorial characterization of the testable graph properties: it’s all about regularity. STOC 2006.
 
3
. Batu. Testing Properties of Distributions. Ph.D. thesis, Cornell University, 2001.
 
4
. Batu, S. Dasgupta, R. Kumar, and R. Rubinfeld. The complexity of approximating the entropy. STOC, 2002.
 
5
. Batu, E. Fischer, L. Fortnow, R. Kumar, R. Rubinfeld, and P. White. Testing random variables for independence and identity. FOCS, 2001.
 
6
. Batu, L. Fortnow, R. Rubinfeld, W.D. Smith, and P. White. "Testing that distributions are close". FOCS, 2000.
 
7
. Batu, R. Kumar, and R. Rubinfeld. Sublinear algorithms for testing monotone and unimodal distributions. STOC, 2004.
 
8
. Blum and S. Kannan. Designing Programs that Check Their Work. STOC 1989.
 
9
. Blum, M. Luby, and R. Rubinfeld. Self--testing/correcting with applications to numerical problems. Journal of Computer and System Sciences 47(3):549---595, 1993.
 
10
. Chakrabarti, G. Cormode, and A. McGregor. A Near--Optimal Algorithm for Computing the Entropy of a Stream. SODA 2007.
 
11
. Charikar, S. Chaudhuri, R. Motwani, and V.R. Narasayya. Towards Estimation Error Guarantees for Distinct Values. PODS, 2000.
 
12
. Goldreich, S. Goldwasser, and D. Ron. Property Testing and Its Connection to Learning and Approximation. FOCS 1996.
 
13
. Goldreich and L. Trevisan. Three theorems regarding testing graph properties. Random Struct. Algorithms, 23(1):23--57, 2003.
 
14
. Guha, A. McGregor, and S. Venkatasubramanian. Streaming and Sublinear Approximation of Entropy and Information Distances. SODA 2006.
 
15
. Klinger. The Vandermonde Matrix. The American Mathematical Monthly, 74(5):571--574, 1967.
 
16
. Indyk and A. McGregor. Declaring Independence via the Sketching of Sketches. SODA 2008.
 
17
. Indyk and D. Woodruff. Tight Lower Bounds for the Distinct Elements Problem. FOCS, 2003.
 
18
. Raskhodnikova, D. Ron, A. Shpilka, and A. Smith. Strong Lower Bounds for Approximating Distribution Support Size and the Distinct Elements Problem. FOCS 2007.
 
19
. Roos. On the Rate of Multivariate Poisson Convergence. Journal of Multivariate Analysis 69:120--134, 1999.
 
20
. Rubinfeld and M. Sudan. Robust Characterizations of Polynomials with Applications to Program Testing. SIAM Journal on Computing 25(2):252--271, 1996.
 
21
. Valiant. Testing Symmetric Properties of Distributions. ECCC report TR07--135, 2007.