CaltechTHESIS
  A Caltech Library Service

Latent-Variable Modeling: Algorithms, Inference, and Applications

Citation

Taeb, Armeen (2020) Latent-Variable Modeling: Algorithms, Inference, and Applications. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/YRF1-7W29. https://resolver.caltech.edu/CaltechTHESIS:09222019-132051506

Abstract

Many driving factors of physical systems are often latent or unobserved. Thus, understanding such systems crucially relies on accounting for the influence of the latent structure. This thesis makes advances in three aspects of latent-variable modeling: inference, algorithms, and applications. Specifically, we develop and explore latent-variable techniques that a) ensure interpretable and statistically significant models, b) can be efficiently optimized to identify best fit to data, and c) provide useful insights in real-world applications. The specific contributions of this thesis are:

1. We employ a latent-variable graphical modeling technique to develop the first state-wide statistical model of the California reservoir network. With this model, we precisely characterize the system-wide behavior of the network to hypothetical drought conditions, and proposed guidelines for more sustainable reservoir management.

2. Motivated by the previous application, we provide a geometric framework to assess the extent to which our latent variable model has learned true or false discoveries about the relevant physical phenomena. Our approach generalizes the classical notions of true and false discoveries in mathematical statistics that rely on the discrete structure of the decision space to settings where the decision space is continuous and more complicated. We highlight the utility of this viewpoint in problems involving subspace selection and low-rank estimation.

3. We propose a convex optimization procedure to fit a latent-variable graphical model for generalized linear models. This framework provides a flexible approach to model non-Gaussian variables including Poisson, Bernoulli, and exponential variables. A particularly novel aspect of our formulation is that it incorporates regularizers that are tailored to the type of latent variables.

4. We describe a computationally efficient framework to learn a latent-variable model with high-dimensional and non-iid data. This framework is based on factoriable precision operators that decouple the component associated with the observational dependencies and the component associated to interdependencies among the variables.

5. We propose a convex optimization technique to provide semantics to latent variables of a factor model. This approach is based on linking auxiliary variables -- chosen based on domain expertise -- to these latent variables.

Item Type:Thesis (Dissertation (Ph.D.))
Subject Keywords:Latent variables; model selection; false discoveries; low-rank estimation; convex optimization
Degree Grantor:California Institute of Technology
Division:Engineering and Applied Science
Major Option:Electrical Engineering
Awards:The W.P. Carey and Co., Inc., Prize in Applied Mathematics, 2020.
Thesis Availability:Public (worldwide access)
Research Advisor(s):
  • Chandrasekaran, Venkat
Group:Resnick Sustainability Institute
Thesis Committee:
  • Hassibi, Babak (chair)
  • Stuart, Andrew M.
  • Pachter, Lior S.
  • Doyle, John Comstock
  • Chandrasekaran, Venkat
Defense Date:16 August 2019
Non-Caltech Author Email:​ armeen.taeb (AT) gmail.com
Funders:
Funding AgencyGrant Number
Resnick Sustainability InstituteUNSPECIFIED
NSFCCF-1350590
Air Force Office of Scientific Research (AFOSR)FA9550-16-1-0210
Record Number:CaltechTHESIS:09222019-132051506
Persistent URL:https://resolver.caltech.edu/CaltechTHESIS:09222019-132051506
DOI:10.7907/YRF1-7W29
Related URLs:
URLURL TypeDescription
https://doi.org/10.1002/2017WR020412DOIArticle adapted for Chapter 2.
https://arxiv.org/pdf/1810.08595.pdfarXivArticle adapted for Chapter 3.
https://doi.org/10.1007/s10107-017-1187-7DOIArticle adapted for Chapter 6.
ORCID:
AuthorORCID
Taeb, Armeen0000-0002-5647-3160
Default Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:11799
Collection:CaltechTHESIS
Deposited By: Armeen Taeb
Deposited On:30 Sep 2019 19:21
Last Modified:18 Dec 2020 18:37

Thesis Files

[img]
Preview
PDF - Final Version
See Usage Policy.

10MB

Repository Staff Only: item control page