Tuning-Free Maximum Likelihood Training of Latent Variable Models via Coin Betting

Sharrock, Louis and Dodd, Daniel and Nemeth, Christopher (2024) Tuning-Free Maximum Likelihood Training of Latent Variable Models via Coin Betting. Proceedings of Machine Learning Research. ISSN 1938-7228 (In Press)

[thumbnail of coin_em-3]
Text (coin_em-3)
Available under License Creative Commons Attribution.

Download (0B)
[thumbnail of coin_em-3]
Text (coin_em-3)
Available under License Creative Commons Attribution.

Download (0B)
[thumbnail of coin_em-3]
Text (coin_em-3) - Accepted Version
Restricted to Repository staff only until 1 January 2040.
Available under License Creative Commons Attribution.

Download (0B)
[thumbnail of coin_em-3]
Text (coin_em-3)
coin_em-3.pdf - Accepted Version
Restricted to Repository staff only until 1 January 2040.
Available under License Creative Commons Attribution.

Download (4MB)

Abstract

We introduce two new particle-based algo- rithms for learning latent variable models via marginal maximum likelihood estimation, including one which is entirely tuning-free. Our methods are based on the perspective of marginal maximum likelihood estimation as an optimization problem: namely, as the minimization of a free energy functional. One way to solve this problem is via the discretiza- tion of a gradient flow associated with the free energy. We study one such approach, which resembles an extension of Stein varia- tional gradient descent, establishing a descent lemma which guarantees that the free energy decreases at each iteration. This method, and any other obtained as the discretization of the gradient flow, necessarily depends on a learn- ing rate which must be carefully tuned by the practitioner in order to ensure convergence at a suitable rate. With this in mind, we also propose another algorithm for optimizing the free energy which is entirely learning rate free, based on coin betting techniques from convex optimization. We validate the performance of our algorithms across several numerical ex- periments, including several high-dimensional settings. Our results are competitive with existing particle-based methods, without the need for any hyperparameter tuning.

Item Type:
Journal Article
Journal or Publication Title:
Proceedings of Machine Learning Research
Uncontrolled Keywords:
Research Output Funding/yes_internally_funded
Subjects:
?? yes - internally fundedyes ??
ID Code:
216011
Deposited By:
Deposited On:
08 Mar 2024 10:05
Refereed?:
Yes
Published?:
In Press
Last Modified:
09 Mar 2024 00:49