Repository logo

On the Learning of Energy-Based Models Using Noise Contrastive Estimation

dc.contributor.authorShi, Boming
dc.contributor.supervisorMao, Yongyi
dc.date.accessioned2022-08-11T17:06:52Z
dc.date.available2022-08-11T17:06:52Z
dc.date.issued2022-08-11en_US
dc.description.abstractEnergy-Based Models (EBMs) are a family of unsupervised machine learning models that associate each point in the input space with an energy value in which low energy indicates a high likelihood. Specifically, an EBM can be viewed as an unnormalized probabilistic model, which upon normalization, gives rise to the probability density function of data. The main difficulty in learning EBMs lies in the computation of the normalization constant, or the partition function, a task known to be intractable in general. Several approaches have been proposed to avoid or overcome this difficulty, including Maximum Likelihood Estimation (MLE) with Markov chain Monte Carlo (MCMC), Score Matching (SM), Noise Contrastive Estimation (NCE), and so on. This thesis studies the learning of EBMs using NCE. Briefly, in NCE, the EBM learning problem is converted to learning a binary classifier, which aims to distinguish the real data from fake data drawn from a noise distribution. This process allows the learning of the energy function in the EBM to bypass a direct estimation of the partition function and a certain theoretical guarantee is available under some assumptions in some asymptotic limit. Despite the nice theoretical properties of NCE, in this work, we show that learning EBMs using NCE entails significant practical limitations. Specifically, there appears a tension between the quality of the learned model and the computational efficiency, due to which we must sacrifice one to achieve the other. We establish these limitations via empirical studies as well as a theoretical analysis based on a simple “Gaussian data learning problem”. Our analysis inspires a revised NCE scheme, Adaptive Noise Contrastive Estimation (ANCE), to overcome these limitations. Empirically, we show that ANCE achieves a better quality-efficiency trade-off than the standard NCE in some regimes.en_US
dc.identifier.urihttp://hdl.handle.net/10393/43904
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-28117
dc.language.isoenen_US
dc.publisherUniversité d'Ottawa / University of Ottawaen_US
dc.rightsAttribution-ShareAlike 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-sa/4.0/*
dc.subjectEBMen_US
dc.subjectNCEen_US
dc.titleOn the Learning of Energy-Based Models Using Noise Contrastive Estimationen_US
dc.typeThesisen_US
thesis.degree.disciplineGénie / Engineeringen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMCSen_US
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Scienceen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Shi_Boming_2022_thesis.pdf
Size:
13.65 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: