Missing data completion using diffusion maps and laplacian pyramids

Neta Rabin, Dalia Fishelov

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A challenging problem in machine learning is handling missing data, also known as imputation. Simple imputation techniques complete the missing data by the mean or the median values. A more sophisticated approach is to use regression to predict the missing data from the complete input columns. In case the dimension of the input data is high, dimensionality reduction methods may be applied to compactly describe the complete input. Then, a regression from the low-dimensional space to the incomplete data column can be constructed from imputation. In this work, we propose a two-step algorithm for data completion. The first step utilizes a non-linear manifold learning technique, named diffusion maps, for reducing the dimension of the data. This method faithfully embeds complex data while preserving its geometric structure. The second step is the Laplacian pyramids multi-scale method, which is applied for regression. Laplacian pyramids construct kernels of decreasing scales to capture finer modes of the data. Experimental results demonstrate the efficiency of our approach on a publicly available dataset.

Original languageEnglish
Title of host publicationComputational Science and Its Applications - ICCSA 2017 - 17th International Conference, 2017
EditorsBeniamino Murgante, Bernady O. Apduhan, Giuseppe Borruso, Elena Stankova, Osvaldo Gervasi, Sanjay Misra, David Taniar, Ana Maria A.C. Rocha, Alfredo Cuzzocrea, Carmelo M. Torre
PublisherSpringer Verlag
Pages284-297
Number of pages14
ISBN (Print)9783319623917
DOIs
StatePublished - 2017
Externally publishedYes
Event17th International Conference on Computational Science and Its Applications, ICCSA 2017 - Trieste, Italy
Duration: 3 Jul 20176 Jul 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10404

Conference

Conference17th International Conference on Computational Science and Its Applications, ICCSA 2017
Country/TerritoryItaly
CityTrieste
Period3/07/176/07/17

Keywords

  • Diffusion maps
  • Dimensionality reduction
  • Laplacian pyramids
  • Missing data

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Missing data completion using diffusion maps and laplacian pyramids'. Together they form a unique fingerprint.

Cite this