Abstract
We propose a new method to estimate Wasserstein distances and optimal transport plans between two probability distributions from samples in high dimension. Unlike plug-in rules that simply replace the true distributions by their empirical counterparts, our method promotes couplings with low transport rank, a new structural assumption that is similar to the nonnegative rank of a matrix. Regularizing based on this assumption leads to drastic improvements on high-dimensional data for various tasks, including domain adaptation in single-cell RNA sequencing data. These findings are supported by a theoretical analysis that indicates that the transport rank is key in overcoming the curse of dimensionality inherent to data-driven optimal transport.
| Original language | English |
|---|---|
| State | Published - 2020 |
| Externally published | Yes |
| Event | 22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japan Duration: 16 Apr 2019 → 18 Apr 2019 |
Conference
| Conference | 22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 |
|---|---|
| Country/Territory | Japan |
| City | Naha |
| Period | 16/04/19 → 18/04/19 |
All Science Journal Classification (ASJC) codes
- Artificial Intelligence
- Statistics and Probability