How Senate Speeches and Votes Reveal Hidden Ideology and Leadership

Insights from the Field

spatial preferences

Gaussian copula

factor model

text

US Senate

Estimating Spatial Preferences from Votes and Text was authored by In Song Kim, John Londregan and Marc Ratkovic. It was published by Cambridge in Pol. An. in 2018.

🧭 What the model does

A new model extends the standard vote-choice framework to jointly generate roll-call votes and legislative speech from a common set of underlying preference parameters. Both votes and text are modeled as manifestations of the same latent preferences, allowing speech to inform estimates that traditionally rely only on votes.

🧰 How preferences are estimated

Estimation uses a sparse Gaussian copula factor model that:
Infers the number of latent dimensions automatically
Is robust to outliers in the data
Accounts for zero inflation common in text counts
This approach links spatial preference estimation (from votes) with latent factor structure recovery (from text) in a single statistical framework.

📦 What data was analyzed

Roll-call votes and floor speeches from recent sessions of the U.S. Senate were used to illustrate the estimator's performance.

🔎 Key findings

Two stable latent dimensions consistently emerge:
A conventional ideological dimension
A second dimension that reflects Senators' leadership roles
The method can leverage shared speech patterns to impute missing vote information, recovering reliable preference estimates for rank-and-file Senators even when only leadership votes are available.

💡 Why it matters

Jointly modeling votes and text provides a fuller, more robust picture of legislative preferences.
The estimator is practical for research settings with noisy text data or incomplete voting records, enabling better inference about legislators' positions and role-based behavior.