Heterogeneous datasets representation and learning using diffusion maps and Laplacian pyramids

Neta Rabin, Ronald R. Coifman

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The diffusion maps together with the geometric harmonics provide a method for describing the geometry of high dimensional data and for extending these descriptions to new data points and to functions, which are defined on the data. This method suffers from two limitations. First, even though real-life data is often heterogeneous , the assumption in diffusion maps is that the attributes of the processed dataset are comparable. Second, application of the geometric harmonics requires careful setting for the correct extension scale and condition number. In this paper, we propose a method for representing and learning heterogeneous datasets by using diffusion maps for unifying and embedding heterogeneous dataset and by replacing the geometric harmonics with the Laplacian pyramid extension. Experimental results on three benchmark datasets demonstrate how the learning process becomes straightforward when the constructed representation smoothly parameterizes the task-related function.

Original languageEnglish
Title of host publicationProceedings of the 12th SIAM International Conference on Data Mining, SDM 2012
PublisherSociety for Industrial and Applied Mathematics (SIAM)
Pages189-199
Number of pages11
ISBN (Print)9781611972320
DOIs
StatePublished - 2012
Externally publishedYes
Event12th SIAM International Conference on Data Mining, SDM 2012 - Anaheim, CA, United States
Duration: 26 Apr 201228 Apr 2012

Publication series

NameProceedings of the 12th SIAM International Conference on Data Mining, SDM 2012

Conference

Conference12th SIAM International Conference on Data Mining, SDM 2012
Country/TerritoryUnited States
CityAnaheim, CA
Period26/04/1228/04/12

All Science Journal Classification (ASJC) codes

  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Heterogeneous datasets representation and learning using diffusion maps and Laplacian pyramids'. Together they form a unique fingerprint.

Cite this