## Abstract

In the distributed uniformity testing problem, k servers draw samples from some unknown distribution, and the goal is to determine whether the unknown distribution is uniform or whether it is ϵ-far from uniform, where ϵ is a proximity parameter. Each server decides whether to accept or reject, and these decisions are sent to a referee, who makes a final decision based on the servers' local decisions. Uniformity testing is a particularly useful building-block, because it is complete for the problem of testing identity to any fixed distribution. It was recently shown that distributing the task of uniformity testing allows each server to draw fewer samples than are needed in the centralized case, but so far the number of samples required for distributed uniformity testing has not been well understood. In this paper we settle this question, and also investigate the cost of using local decision rules, such as rejecting iff at least one server wants to reject (the usual decision rule used in local distributed decision). To answer these questions, we develop a new Fourier-based technique for proving lower bounds on the sample complexity of distribution testing, which lends itself particularly well to the distributed case. Using our technique, we tightly characterize the number of samples required for uniformity testing when the referee can apply any decision function to the servers' local decisions. We also show that if the network rejects whenever one server wants to reject, then the cost of uniformity testing is much higher, and in fact we do not gain compared to the centralized case unless the number of servers is exponential in Ω(1/ϵ). Finally, we apply our lower bound technique to the case where the referee applies a threshold decision rule, and also generalize a lower bound from [1] for learning an unknown input distribution.

Original language | English |
---|---|

Title of host publication | PODC 2019 - Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing |

Pages | 228-237 |

Number of pages | 10 |

ISBN (Electronic) | 9781450362177 |

DOIs | |

State | Published - 16 Jul 2019 |

Event | 38th ACM Symposium on Principles of Distributed Computing, PODC 2019 - Toronto, Canada Duration: 29 Jul 2019 → 2 Aug 2019 |

### Publication series

Name | Proceedings of the Annual ACM Symposium on Principles of Distributed Computing |
---|

### Conference

Conference | 38th ACM Symposium on Principles of Distributed Computing, PODC 2019 |
---|---|

Country/Territory | Canada |

City | Toronto |

Period | 29/07/19 → 2/08/19 |

## Keywords

- Boolean analysis
- Distributed computing
- Uniformity testing

## All Science Journal Classification (ASJC) codes

- Software
- Hardware and Architecture
- Computer Networks and Communications