FIND DATA: By Author | Journal | Sites   ANALYZE DATA: Help with R | SPSS | Stata | Excel   WHAT'S NEW? US Politics | Int'l Relations | Law & Courts
   FIND DATA: By Author | Journal | Sites   WHAT'S NEW? US Politics | IR | Law & Courts
If this link is broken, please report as broken. You can also submit updates (will be reviewed).
Measure Election Fraud From Digits Without P-Values
Insights from the Field
digit forensics
election fraud
latent class
mixture index
Beber-Scacco
Methodology
Pol. An.
2 R files
1 Datasets
12 Text
8 Other
Dataverse
Election Fraud: a Latent Class Framework for Digit-Based Tests was authored by Juraj Medzihorsky. It was published by Cambridge in Pol. An. in 2015.

🔎 What Was the Problem?

Digit-based election forensics (DBEF) typically depends on null-hypothesis significance testing, which can distort substantive conclusions and leave practitioners with hard-to-interpret results.

🧩 How the New Framework Works

The approach decomposes the observed numeral distribution into two latent classes—"no fraud" and "fraud"—by identifying the smallest fraction of numerals that must be removed or reallocated to attain a perfect fit to the "no fraud" model. That fraction is directly interpretable as a measure of fraudulence.

  • Two specific procedures are described:
  • Removing numerals until the remainder perfectly fits the "no fraud" model (a removal-based measure).
  • Reallocating numerals to achieve a perfect fit (a reallocation-based measure).
  • These two procedures map onto established fit measures: the π* (pi-star) mixture index of fit and the Δ (Delta) dissimilarity index, respectively.

⚙️ Relaxing Distributional Assumptions

Independently of the latent-class decomposition, the distributional assumptions that standard DBEF methods require can be relaxed in some contexts. Either alone or together, the latent-class framework and these relaxed assumptions permit decomposition and model-fitting that are more flexible than existing DBEF approaches.

📊 Reanalysis of Existing Data

Application of the method to Beber and Scacco (2012) data demonstrates that the latent-class approach can produce different substantive conclusions than prior analyses, illustrating its practical implications for forensic inference.

⚖️ Why It Matters

The framework avoids overreliance on hypothesis-testing heuristics, yields an interpretable fraud measure (the minimal fraction of problematic numerals), and expands the modeling toolkit for digit-based election forensics, enabling clearer, more nuanced assessments of suspicious numeral patterns.

data
Find on Google Scholar
Find on JSTOR
Find on CUP
Political Analysis
Podcast host Ryan