25 Years of the Manning Hypothesis — What 2D:4D Research Has Confirmed and What It Has Lost
In 1998, a British anthropologist named John Manning published a paper in Human Reproduction arguing that finger length ratios could serve as a biomarker of prenatal hormone exposure. Twenty-five years later, his hypothesis has spawned thousands of follow-up studies — some of which hold up remarkably well, others of which have quietly collapsed. This article traces that trajectory in a single piece.
1. The Beginning: Manning's 1998 Paper
The original paper was titled "The ratio of 2nd to 4th digit length: A predictor of sperm numbers and concentrations of testosterone, luteinizing hormone and oestrogen." The research team measured finger ratios, sperm counts, and sex hormone concentrations in 60 men. The main findings:
- Lower 2D:4D correlated with higher sperm counts and testosterone levels
- The ratio might serve as a non-invasive indicator of prenatal sex hormone exposure
- Men had lower average 2D:4D than women (sexual dimorphism)
The paper was cautious, but its implications were huge: measure a finger, estimate the fetal hormonal environment. That simplicity drove the explosive expansion of 2D:4D research that followed.
2. The 2000s: A Period of Unlimited Expansion
Through the early-to-mid 2000s, 2D:4D spread into nearly every domain connected to sex hormones. Soccer players, sexual orientation, autism spectrum disorders, schizophrenia, alcoholism, math ability, depression — researchers found the temptation of a "just measure a finger" biomarker hard to resist.
Representative early claims:
- People with low 2D:4D have more "masculine" brain architecture (Baron-Cohen's Extreme Male Brain theory)
- Non-heterosexual women have lower 2D:4D than heterosexual women
- Low 2D:4D correlates strongly with numerical ability
- Links to infidelity, aggression, and risk-taking
Some of these claims have weakened or failed to replicate. Popular media, however, still cite the strong claims from this era.
3. 2011: Zheng & Cohn Establish the Mechanism
The most decisive piece of evidence for the Manning hypothesis came in 2011, when Zheng & Cohn published mouse experiments in PNAS. They showed that in fetal mouse finger primordia, androgen receptor (AR) and estrogen receptor (ER-α) expression was greater in the 4th digit than the 2nd.
The crucial experiment: genetically blocking AR during fetal development suppressed 4th digit growth, raising 2D:4D (feminization); blocking ER had the opposite effect. This was direct evidence that Manning's original hypothesis was biologically plausible.
This paper is widely regarded as the turning point. Before it, 2D:4D research was correlational. Afterward, it was a hypothesis with an established mechanism.
4. Mid-to-Late 2010s: Skepticism Arrives
But having a mechanism doesn't mean every downstream claim is right. In the late 2010s, methodological skepticism gained traction.
The Allometry Debate
Kratochvíl and Flegr argued that men's lower 2D:4D might simply be a mathematical byproduct of men having longer fingers overall. Counter-arguments followed, but the debate unsettled a basic premise of the field.
The GWAS Shock
In 2018, Warrington et al. conducted a large genome-wide association study (GWAS) and identified 9 novel loci associated with 2D:4D. But none of them connected directly to androgen signaling pathways. Nor did the predicted X-linked inheritance pattern appear in the data. This was a serious blow to the claim that 2D:4D is a strong marker of prenatal androgens.
The Replication Crisis
The broader replication crisis in psychology hit 2D:4D research hard. Early "strong correlations" repeatedly weakened or disappeared in meta-analyses. Aggression, Big5 personality, and cognitive function saw the sharpest declines.
5. The 2020s: "Through a Glass, Darkly"
Swift-Gallant et al.'s 2020 review in Hormones and Behavior summarized the field's status with its title: "Through a glass, darkly: Human digit ratios reflect prenatal androgens, imperfectly." 2D:4D reflects fetal hormones — imperfectly.
A 2024 meta-analysis refined this further: 2D:4D relates to amniotic fluid testosterone but not umbilical cord blood testosterone. This suggests that 2D:4D reflects the hormonal environment of mid-pregnancy, not the hormonal state at birth.
The 2025 meta-analyses show mixed results across domains:
- Sports performance: weak but consistent negative correlation maintained (Gower 2025)
- Academic achievement: no statistically significant correlation (2025 meta-analysis)
- Sexual orientation: consistent effect in women, weak or mixed in men (2025 review of 60 papers)
- ADHD: weak but significant correlation (meta-analysis 2024)
- Cognitive function: insufficient evidence
6. What Still Stands
After 25 years, the findings that remain robust:
- Sex difference: Sexual dimorphism in 2D:4D replicates across every population tested. The greater effect size on the right hand also survives.
- Biological mechanism: Prenatal androgen/estrogen signaling controlling 4th digit growth is experimentally established.
- Population variation: Ethnic differences in mean 2D:4D are repeatedly confirmed.
- Weak correlations with endurance and competitive performance: Effect sizes are small but consistent across meta-analyses.
- Independence from adult hormones: Meta-analyses reconfirm that 2D:4D is unrelated to adult testosterone.
7. What Has Collapsed
And the claims that failed to replicate or weakened substantially:
- Strong personality correlations: Big5 links have nearly vanished.
- Aggression prediction: Meta-analyses find r ≈ 0.036 — effectively zero.
- AR CAG repeat polymorphism: The predicted connection to AR gene variation was not confirmed by GWAS.
- X-linked inheritance: Manning's predicted inheritance pattern was not supported in large-scale GWAS.
- Academic achievement and intelligence: No significant effect in 2025 meta-analyses.
8. The Lesson of 25 Years
What Manning's hypothesis has shown over 25 years is a textbook example of how scientific claims get filtered over time. Most of the flashy early claims have weakened, while the core findings — sex differences, biological mechanism, some group-level correlations — have survived.
This isn't a story of failure; it's a story of science working. 2D:4D has moved from the early promise of "a perfect window into prenatal hormones" to a more modest position: "an imperfect signal seen through a glass, darkly." Researchers still study it, but with much more caution in interpretation.
For general readers, the takeaway is simple: 2D:4D tests can be an interesting self-exploration tool, but they're not a crystal ball for your future. They reflect a weak signal of the prenatal hormonal environment — and even that signal is seen through a cloudy glass.
References
- Manning JT et al. (1998). The ratio of 2nd to 4th digit length. Human Reproduction, 13(11), 3000-3004.
- Zheng Z, Cohn MJ (2011). Developmental basis of sexually dimorphic digit ratios. PNAS, 108(39), 16289-16294.
- Swift-Gallant A et al. (2020). Through a glass, darkly. Hormones and Behavior, 120, 104686.
- Warrington NM et al. (2018). GWAS identifies nine novel loci for 2D:4D. Human Molecular Genetics, 27(11), 2025-2038.
- Gower B et al. (2025). Digit Ratio and Cardiorespiratory Fitness. American Journal of Human Biology, 37(4).