TY - GEN
T1 - On approximating the number of k-Cliques in sublinear time
AU - Eden, Talya
AU - Ron, Dana
AU - Seshadhri, C.
N1 - Publisher Copyright: © 2018 Association for Computing Machinery.
PY - 2018/6/20
Y1 - 2018/6/20
N2 - We study the problem of approximating the number of k-cliques in a graph when given query access to the graph. We consider the standard query model for general graphs via (1) degree queries, (2) neighbor queries and (3) pair queries. Let n denote the number of vertices in the graph, m the number of edges, and Ck the number of k-cliques. We design an algorithm that outputs a (1 +)-approximation (with high probability) for Ck, whose expected query complexity and running time are O Ck n 1/k + m C k k /2 poly(log n, 1/, k). Hence, the complexity of the algorithm is sublinear in the size of the graph for Ck = (mk/2−1). Furthermore, the query complexity of our algorithm is essentially optimal (up to the dependence on log n, 1/and k). The previous results in this vein are by Feige (SICOMP 06) and by Goldreich and Ron (RSA 08) for edge counting (k = 2) and by Eden et al. (FOCS 2015) for triangle counting (k = 3). Our result matches the complexities of these results. The previous result by Eden et al. hinges on a certain amortization technique that works for triangle counting, and does not generalize to all k. We obtain a general algorithm that works for any k ≥ 3 by designing a procedure that samples each k-clique incident to a given set S of vertices with approximately equal probability. The primary difficulty is in finding cliques incident to purely high-degree vertices, since random sampling within neighbors has a low success probability. This is achieved by an algorithm that samples uniform random high degree vertices and a careful tradeoff between estimating cliques incident purely to high-degree vertices and those that include a low-degree vertex.
AB - We study the problem of approximating the number of k-cliques in a graph when given query access to the graph. We consider the standard query model for general graphs via (1) degree queries, (2) neighbor queries and (3) pair queries. Let n denote the number of vertices in the graph, m the number of edges, and Ck the number of k-cliques. We design an algorithm that outputs a (1 +)-approximation (with high probability) for Ck, whose expected query complexity and running time are O Ck n 1/k + m C k k /2 poly(log n, 1/, k). Hence, the complexity of the algorithm is sublinear in the size of the graph for Ck = (mk/2−1). Furthermore, the query complexity of our algorithm is essentially optimal (up to the dependence on log n, 1/and k). The previous results in this vein are by Feige (SICOMP 06) and by Goldreich and Ron (RSA 08) for edge counting (k = 2) and by Eden et al. (FOCS 2015) for triangle counting (k = 3). Our result matches the complexities of these results. The previous result by Eden et al. hinges on a certain amortization technique that works for triangle counting, and does not generalize to all k. We obtain a general algorithm that works for any k ≥ 3 by designing a procedure that samples each k-clique incident to a given set S of vertices with approximately equal probability. The primary difficulty is in finding cliques incident to purely high-degree vertices, since random sampling within neighbors has a low success probability. This is achieved by an algorithm that samples uniform random high degree vertices and a careful tradeoff between estimating cliques incident purely to high-degree vertices and those that include a low-degree vertex.
KW - Approximation algorithms
KW - Counting cliques
KW - Sublinear algorithms
UR - http://www.scopus.com/inward/record.url?scp=85049871103&partnerID=8YFLogxK
U2 - https://doi.org/10.1145/3188745.3188810
DO - https://doi.org/10.1145/3188745.3188810
M3 - منشور من مؤتمر
T3 - Proceedings of the Annual ACM Symposium on Theory of Computing
SP - 100
EP - 113
BT - STOC 2018 - Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing
A2 - Henzinger, Monika
A2 - Kempe, David
A2 - Diakonikolas, Ilias
T2 - 50th Annual ACM Symposium on Theory of Computing, STOC 2018
Y2 - 25 June 2018 through 29 June 2018
ER -