Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published October 2021 | Submitted
Journal Article Open

Posterior consistency of semi-supervised regression on graphs

Abstract

Graph-based semi-supervised regression (SSR) involves estimating the value of a function on a weighted graph from its values (labels) on a small subset of the vertices; it can be formulated as a Bayesian inverse problem. This paper is concerned with the consistency of SSR in the context of classification, in the setting where the labels have small noise and the underlying graph weighting is consistent with well-clustered vertices. We present a Bayesian formulation of SSR in which the weighted graph defines a Gaussian prior, using a graph Laplacian, and the labeled data defines a likelihood. We analyze the rate of contraction of the posterior measure around the ground truth in terms of parameters that quantify the small label error and inherent clustering in the graph. We obtain bounds on the rates of contraction and illustrate their sharpness through numerical experiments. The analysis also gives insight into the choice of hyperparameters that enter the definition of the prior.

Additional Information

© 2021 IOP Publishing Ltd. Received 24 March 2021; Revised 6 July 2021; Accepted 17 August 2021; Published 30 September 2021. This work is supported by NSF grant DMS 1818977, AFOSR grant FA9550-17-1-0185, NSERC PDF fellowship, a Caltech Von Kármán instructorship, DOD NDSEG Fellowship, and DARPA grant FA8750-18-2-0066.

Attached Files

Submitted - 2007.12809.pdf

Files

2007.12809.pdf
Files (720.8 kB)
Name Size Download all
md5:701f9dea4d014f7de454863c5aeb6981
720.8 kB Preview Download

Additional details

Created:
August 20, 2023
Modified:
March 5, 2024