Distributed data clustering in sensor networks

Ittay Eyal, Idit Keidar, RAPHAEL ROM

Research output: Contribution to journalArticlepeer-review

Abstract

Low overhead analysis of large distributed data sets is necessary for current data centers and for future sensor networks. In such systems, each node holds some data value, e.g., a local sensor read, and a concise picture of the global system state needs to be obtained. In resource-constrained environments like sensor networks, this needs to be done without collecting all the data at any location, i.e., in a distributed manner. To this end, we address the distributed clustering problem, inwhich numerous interconnected nodes compute a clustering of their data, i.e., partition these values into multiple clusters, and describe each cluster concisely. We present a generic algorithm that solves the distributed clustering problem and may be implemented in various topologies, using different clustering types. For example, the generic algorithm can be instantiated to cluster values according to distance, targeting the same problem as the famous k-means clustering algorithm. However, the distance criterion is often not sufficient to provide good clustering results. We present an instantiation of the generic algorithm that describes the values as a Gaussian Mixture (a set of weighted normal distributions), and uses machine learning tools for clustering decisions. Simulations show the robustness, speed and scalability of this algorithm. We prove that

Original languageEnglish
Pages (from-to)207-222
Number of pages16
JournalDistributed Computing
Volume24
Issue number5
DOIs
StatePublished - Dec 2011

Keywords

  • Distributed clustering
  • Robust aggregation
  • Sensor networks

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Hardware and Architecture
  • Computer Networks and Communications
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Distributed data clustering in sensor networks'. Together they form a unique fingerprint.

Cite this