Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

Posted: Jan 26th 2026
Abstract

We propose a new non-linear data augmentation framework powered by information geometry.

Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint

arXiv | GitHub

Brief Summary

We propose a simple yet novel data augmentation method for general data modalities based on energy-based modeling and principles from information geometry. Unlike most existing learning-based data augmentation methods, which rely on learning latent representations with generative models, our proposed framework enables an intuitive construction of a geometrically aware latent space that represents the structure of the data itself, supporting efficient and explicit encoding and decoding procedures. We then present and discuss how to design latent spaces that will subsequently control the augmentation with the proposed algorithm. Empirical results demonstrate that our data augmentation method achieves competitive performance in downstream tasks compared to other baselines, while offering fine-grained controllability that is lacking in the existing literature.

Citation

@inproceedings{hu2026pseudononlinear,
	title={Pseudo-Nonlinear Data Augmentation: A Constrained Energy Minimization Viewpoint},
	author={Pingbang Hu and Mahito Sugiyama},
	booktitle={The Fourteenth International Conference on Learning Representations},
	year={2026},
	url={https://openreview.net/forum?id=p9A1oyktVB}
}
Last Updated on Feb 9th 2026