Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published April 2021 | Supplemental Material + Submitted
Journal Article Open

Scaling Data from Multiple Sources

Abstract

We introduce a method for scaling two datasets from different sources. The proposed method estimates a latent factor common to both datasets as well as an idiosyncratic factor unique to each. In addition, it offers a flexible modeling strategy that permits the scaled locations to be a function of covariates, and efficient implementation allows for inference through resampling. A simulation study shows that our proposed method improves over existing alternatives in capturing the variation common to both datasets, as well as the latent factors specific to each. We apply our proposed method to vote and speech data from the 112th U.S. Senate. We recover a shared subspace that aligns with a standard ideological dimension running from liberals to conservatives, while recovering the words most associated with each senator's location. In addition, we estimate a word-specific subspace that ranges from national security to budget concerns, and a vote-specific subspace with Tea Party senators on one extreme and senior committee leaders on the other.

Additional Information

© The Author(s) 2020. Published by Cambridge University Press on behalf of the Society for Political Methodology. Published 23 November 2020. We are grateful to Arthur Spirling, Jacob Neihesel, John Londregan, and Dustin Tingley and audiences at Princeton University, New York University, and the University of Buffalo for for helpful comments on an suggestions. Data Availability Statement: Replication code for this article has been published in Code Ocean, a computational reproducibility platform that enables users to run the code, and can be viewed interactively at Enamorado et al. (2020a) or https://doi.org/10.24433/CO.3824807.v1. A preservation copy of the same code and data can also be accessed via Harvard Dataverse at Enamorado et al. (2020b) or https://doi.org/10.7910/DVN/FOUVELL.

Attached Files

Submitted - md2s_rr.pdf

Supplemental Material - S1047198720000248sup001.pdf

Files

S1047198720000248sup001.pdf
Files (1.5 MB)
Name Size Download all
md5:06c48972a503c414ababbcee21eb1f53
502.1 kB Preview Download
md5:1b929e6984f8c903973fb26071a20cc9
987.2 kB Preview Download

Additional details

Created:
August 20, 2023
Modified:
October 20, 2023