Disentanglement Analysis in Deep Latent Variable Models Matching Aggregate Posterior Distributions

Surojit Saha,Sarang C. Joshi,Ross T. Whitaker

Published 2025 in IEEE International Conference on Acoustics, Speech, and Signal Processing

ABSTRACT

Deep latent variable models (DLVMs) are designed to learn meaningful representations in an unsupervised manner, such that the hidden explanatory factors are interpretable by independent latent variables (aka disentanglement). The variational autoencoder (VAE) [1], [2] is a popular DLVM widely studied in disentanglement analysis due to the modeling of the posterior distribution using a factorized Gaussian distribution [3] that encourages the alignment of the latent factors with the latent axes. Several metrics have been proposed recently, assuming that the latent variables explaining the variation in data are aligned with the latent axes (cardinal directions). However, there are other DLVMs, such as the AAE and WAE-MMD (matching the aggregate posterior to the prior), where the latent variables might not be aligned with the latent axes. In this work, we propose a statistical method to evaluate disentanglement for any DLVMs in general. The proposed technique discovers the latent vectors representing the generative factors of a dataset that can be different from the cardinal latent axes. We empirically demonstrate the advantage of the method on two datasets.

PUBLICATION RECORD

Publication year
2025
Venue
IEEE International Conference on Acoustics, Speech, and Signal Processing
Publication date
2025-01-26
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1109/ICASSP49660.2025.10889788 arXiv 2501.15705
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Joint Audio-Visual Idling Vehicle Detection with Streamlined Input Dependencies
2024cited by this paper
Multitask Training as Regularization Strategy for Seismic Image Segmentation
2023cited by this paper
Real-Time Idling Vehicles Detection Using Combined Audio-Visual Deep Learning
2023cited by this paper
Matching aggregate posteriors in the variational autoencoder
2023influential reference
DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability
2022cited by this paper
Few-Shot Segmentation of Microscopy Images Using Gaussian Process
2022cited by this paper
Score-Based Generative Modeling through Stochastic Differential Equations
2020cited by this paper
AUTO-ENCODING VARIATIONAL BAYES
2020cited by this paper
GENs: generative encoding networks
2020cited by this paper
Don't Blame the ELBO! A Linear VAE Perspective on Posterior Collapse
2019cited by this paper
From Variational to Deterministic Autoencoders
2019cited by this paper
Understanding Posterior Collapse in Generative Latent Variable Models
2019cited by this paper
Variational Autoencoders Pursue PCA Directions (by Accident)
2018cited by this paper
Isolating Sources of Disentanglement in VAEs
2018influential reference
Distribution Matching in Variational Inference
2018cited by this paper
Disentangling by Factorising
2018influential reference
A Framework for the Quantitative Evaluation of Disentangled Representations
2018cited by this paper
Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations
2018influential reference
Recent Advances in Autoencoder-Based Representation Learning
2018cited by this paper
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
2017cited by this paper
Variational Inference of Disentangled Latent Concepts from Unlabeled Observations
2017cited by this paper
Wasserstein Auto-Encoders
2017cited by this paper
Optimization as a Model for Few-Shot Learning
2016cited by this paper
beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework
2016influential reference
Stochastic Backpropagation and Approximate Inference in Deep Generative Models
2014cited by this paper
Representation Learning: A Review and New Perspectives
2012cited by this paper

CITED BY

AdaSemSeg: An Adaptive Few-Shot Semantic Segmentation of Seismic Facies
2025cites this paper