FIND DATA: By Author | Journal | Sites   ANALYZE DATA: Help with R | SPSS | Stata | Excel   WHAT'S NEW? US Politics | Int'l Relations | Law & Courts
   FIND DATA: By Author | Journal | Sites   WHAT'S NEW? US Politics | IR | Law & Courts
If this link is broken, please report as broken. You can also submit updates (will be reviewed).
How Senate Speeches and Votes Reveal Hidden Ideology and Leadership
Insights from the Field
spatial preferences
Gaussian copula
factor model
text
US Senate
Methodology
Pol. An.
8 R files
8 other files
16 datasets
1 text files
Dataverse
Estimating Spatial Preferences from Votes and Text was authored by In Song Kim, John Londregan and Marc Ratkovic. It was published by Cambridge in Pol. An. in 2018.

🧭 What the model does

A new model extends the standard vote-choice framework to jointly generate roll-call votes and legislative speech from a common set of underlying preference parameters. Both votes and text are modeled as manifestations of the same latent preferences, allowing speech to inform estimates that traditionally rely only on votes.

🧰 How preferences are estimated

  • Estimation uses a sparse Gaussian copula factor model that:
  • Infers the number of latent dimensions automatically
  • Is robust to outliers in the data
  • Accounts for zero inflation common in text counts
  • This approach links spatial preference estimation (from votes) with latent factor structure recovery (from text) in a single statistical framework.

📦 What data was analyzed

  • Roll-call votes and floor speeches from recent sessions of the U.S. Senate were used to illustrate the estimator's performance.

🔎 Key findings

  • Two stable latent dimensions consistently emerge:
  • A conventional ideological dimension
  • A second dimension that reflects Senators' leadership roles
  • The method can leverage shared speech patterns to impute missing vote information, recovering reliable preference estimates for rank-and-file Senators even when only leadership votes are available.

💡 Why it matters

  • Jointly modeling votes and text provides a fuller, more robust picture of legislative preferences.
  • The estimator is practical for research settings with noisy text data or incomplete voting records, enabling better inference about legislators' positions and role-based behavior.
data
Find on Google Scholar
Find on JSTOR
Find on CUP
Political Analysis
Podcast host Ryan