TY - GEN
T1 - Monitoring distributed, heterogeneous data streams
T2 - 1st International Conference on Applied Algorithms, ICAA 2014
AU - Keren, Daniel
AU - Sagy, Guy
AU - Abboud, Amir
AU - Ben-David, David
AU - Schuster, Assaf
AU - Sharfman, Izchak
AU - Deligiannakis, Antonios
N1 - Publisher Copyright: © Springer International Publishing Switzerland 2014.
PY - 2014
Y1 - 2014
N2 - In many emerging applications, the data to be monitored is of very high volume, dynamic, and distributed, making it infeasible to collect the distinct data streams to a central node and process them there. Often, the monitoring problem consists of determining whether the value of a global function, which depends on the union of all streams, crossed a certain threshold. A great deal of effort is directed at reducing communication overhead by transforming the monitoring of the global function to the testing of local constraints, checked independently at the nodes. Recently, geometric monitoring (GM) proved to be very useful for constructing such local constraints for general (non-linear, non-monotonic) functions. Alas, in all current variants of geometric monitoring, the constraints at all nodes share an identical structure and are, thus, unsuitable for handling heterogeneous streams, which obey different distributions at the distinct nodes. To remedy this, we propose a general approach for geometric monitoring of heterogeneous streams (HGM), which defines constraints tailored to fit the distinct data distributions at the nodes. While optimally selecting the constraints is an NP-hard problem, we provide a practical solution, which seeks to reduce running time by hierarchically clustering nodes with similar data distributions and then solving more, but simpler, optimization problems.
AB - In many emerging applications, the data to be monitored is of very high volume, dynamic, and distributed, making it infeasible to collect the distinct data streams to a central node and process them there. Often, the monitoring problem consists of determining whether the value of a global function, which depends on the union of all streams, crossed a certain threshold. A great deal of effort is directed at reducing communication overhead by transforming the monitoring of the global function to the testing of local constraints, checked independently at the nodes. Recently, geometric monitoring (GM) proved to be very useful for constructing such local constraints for general (non-linear, non-monotonic) functions. Alas, in all current variants of geometric monitoring, the constraints at all nodes share an identical structure and are, thus, unsuitable for handling heterogeneous streams, which obey different distributions at the distinct nodes. To remedy this, we propose a general approach for geometric monitoring of heterogeneous streams (HGM), which defines constraints tailored to fit the distinct data distributions at the nodes. While optimally selecting the constraints is an NP-hard problem, we provide a practical solution, which seeks to reduce running time by hierarchically clustering nodes with similar data distributions and then solving more, but simpler, optimization problems.
UR - http://www.scopus.com/inward/record.url?scp=84927673363&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-04126-1_2
DO - 10.1007/978-3-319-04126-1_2
M3 - Conference contribution
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 17
EP - 28
BT - Applied Algorithms - 1st International Conference, ICAA 2014, Proceedings
A2 - Gupta, Prosenjit
A2 - Zaroliagis, Christos
PB - Springer Verlag
Y2 - 13 January 2014 through 15 January 2014
ER -