Accuracy of latent-variable estimation in Bayesian semi-supervised learning

Published 2013 in Neural Networks

ABSTRACT

Hierarchical probabilistic models, such as Gaussian mixture models, are widely used for unsupervised learning tasks. These models consist of observable and latent variables, which represent the observable data and the underlying data-generation process, respectively. Unsupervised learning tasks, such as cluster analysis, are regarded as estimations of latent variables based on the observable ones. The estimation of latent variables in semi-supervised learning, where some labels are observed, will be more precise than that in unsupervised, and one of the concerns is to clarify the effect of the labeled data. However, there has not been sufficient theoretical analysis of the accuracy of the estimation of latent variables. In a previous study, a distribution-based error function was formulated, and its asymptotic form was calculated for unsupervised learning with generative models. It has been shown that, for the estimation of latent variables, the Bayes method is more accurate than the maximum-likelihood method. The present paper reveals the asymptotic forms of the error function in Bayesian semi-supervised learning for both discriminative and generative models. The results show that the generative model, which uses all of the given data, performs better when the model is well specified.

PUBLICATION RECORD

Publication year
2013
Venue
Neural Networks
Publication date
2013-08-09
Fields of study
Mathematics, Computer Science, Medicine
Identifiers
DOI 10.1016/j.neunet.2015.04.012 arXiv 1308.2029 PMID 26005790
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Accuracy analysis of semi-supervised classification when the class balance changes
2015cited by this paper
Asymptotic accuracy of Bayes estimation for latent variables with redundancy
2012cited by this paper
Asymptotic accuracy of distribution-based estimation of latent variables
2012influential reference
An Asymptotic Behaviour of the Marginal Likelihood for General Markov Models
2011cited by this paper
Asymptotic analysis of Bayesian generalization error with Newton diagram
2010cited by this paper
Stochastic Complexity and Generalization Error of a Restricted Boltzmann Machine in Bayesian Estimation
2010cited by this paper
Equations of States in Singular Statistical Estimation
2007cited by this paper
A Model Selection Method Based on Bound of Learning Coefficient
2006cited by this paper
Stochastic complexities of reduced rank regression in Bayesian estimation
2005cited by this paper
Variational algorithms for approximate Bayesian inference
2003cited by this paper
Toward a perception-based theory of probabilistic reasoning with imprecise probabilities
2002influential reference
Asymptotic Model Selection for Naive Bayesian Networks
2002cited by this paper
Agile Software Development
2002cited by this paper
Graphical Models and Variational Methods
2001cited by this paper
Algebraic Analysis for Non-identifiable Learning Machines
2000influential reference
Inferring Parameters and Structure of Latent Variable Models by Variational Bayes
1999cited by this paper
Information-theoretic asymptotics of Bayes methods
1990influential reference
A statistical approach to learning and generalization in layered neural networks
1989cited by this paper
Stochastic Complexity and Modeling
1986cited by this paper
Maximum Likelihood Estimation of Misspecified Models
1982cited by this paper
Estimating the Dimension of a Model
1978influential reference
Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper
1977cited by this paper
A new look at the statistical model identification
1974influential reference

CITED BY

Application of Artificial Intelligence in Diagnosis of Craniopharyngioma
2022cites this paper
A Novel Semi-Supervised Sparse Bayesian Regression Based on Variational Inference for Industrial Datasets With Incomplete Outputs
2020cites this paper
Inference of time-delayed gene regulatory networks based on dynamic Bayesian network hybrid learning method
2017cites this paper
Effects of additional data on Bayesian clustering
2016influential citation
Application of Artiﬁcial Intelligence in Diagnosis of Craniopharyngioma
year unknowncites this paper