Mean Estimation in High-Dimensional Binary Markov Gaussian Mixture Models

Yihan Zhang, Nir Weinberger

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We consider a high-dimensional mean estimation problem over a binary hidden Markov model, which illuminates the interplay between memory in data, sample size, dimension, and signal strength in statistical inference. In this model, an estimator observes n samples of a d-dimensional parameter vector θ ∈ Rd, multiplied by a random sign Si (1 ≤ i ≤ n), and corrupted by isotropic standard Gaussian noise. The sequence of signs {Si}i∈[n] ∈ {−1, 1}n is drawn from a stationary homogeneous Markov chain with flip probability δ ∈ [0, 1/2]. As δ varies, this model smoothly interpolates two well-studied models: the Gaussian Location Model for which δ = 0 and the Gaussian Mixture Model for which δ = 1/2. Assuming that the estimator knows δ, we establish a nearly minimax optimal (up to logarithmic factors) estimation error rate, as a function of ∥θ∥, δ, d, n. We then provide an upper bound to the case of estimating δ, assuming a (possibly inaccurate) knowledge of θ. The bound is proved to be tight when θ is an accurately known constant. These results are then combined to an algorithm which estimates θ with δ unknown a priori, and theoretical guarantees on its error are stated.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022
EditorsS. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, A. Oh
ISBN (Electronic)9781713871088
StatePublished - 2022
Event36th Conference on Neural Information Processing Systems, NeurIPS 2022 - New Orleans, United States
Duration: 28 Nov 20229 Dec 2022

Publication series

NameAdvances in Neural Information Processing Systems
Volume35

Conference

Conference36th Conference on Neural Information Processing Systems, NeurIPS 2022
Country/TerritoryUnited States
CityNew Orleans
Period28/11/229/12/22

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

Fingerprint

Dive into the research topics of 'Mean Estimation in High-Dimensional Binary Markov Gaussian Mixture Models'. Together they form a unique fingerprint.

Cite this