What Predicts Founder Success?

A Research-Backed Scoring Framework — 2026-03-09 · 36 sources surveyed (21 academic, 6 VC, 5 industry, 1 book, 3 tools) · 28 deep-analyzed · 4 frontier models consulted

TL;DR

We surveyed 36 sources (21 academic papers, 6 VC practitioner analyses, 5 industry reports, 1 book, 3 tools), fetched and extracted structured findings from 28, and cross-validated with 4 frontier AI models (Opus 4.6, GPT-5.4-Pro, Grok 4.1 Reasoning, Gemini)
The result is a 6-dimension scoring framework grounded in what actually predicts founder/operator outcomes — not VC folklore
Every dimension is scorable from public data (LinkedIn, Crunchbase, GitHub, patents, SEC filings) — no interviews required
The framework aligns with how Sequoia, a16z, Benchmark, and YC actually evaluate founders — but makes it automated and consistent
Each dimension is backed by specific studies with measured effect sizesA quantitative measure of the magnitude of a phenomenon. Unlike p-values (which just say "is it real?"), effect sizes say "how big is it?" Common measures: Cohen's d, odds ratio, correlation coefficient r. — 176 total factors extracted across sources

1. The Framework

Six dimensions, weighted by research-validated predictive power and constrained to what's measurable from public data:

#	Dimension	Weight	Measurability	Research Basis
1	Founder-Market Fit	25%	High	Kauffman KFS, Azoulay et al. 2020, Sequoia/a16z/YC consensus
2	Prior Operational Evidence	20%	High	Gompers et al. 2010 (14K ventures), First Round 10-Year (300 cos)
3	Team Quality	20%	Medium-High	First Round (+163% for pairs), Torssell 2022 (25K ventures)
4	Network Position	15%	Medium	Bonaventura et al. 2020 (41K cos, Nature), Burt 2004
5	Technical Breadth	10%	High	Lazear 2005 ("jack of all trades"), sector-dependent
6	Leadership Magnetism	10%	Medium	a16z CEO framework, early hire quality as proxy

Why these six and not others?

Common VC intuitions like grit, academic pedigree, and raw intelligence were deliberately excluded. Grit reduces to conscientiousness when properly measured (Credé 2017, 88-study meta-analysis). Academic pedigree has rPearson correlation coefficient. Ranges from -1 to +1. In social science, r=0.10 is small, r=0.30 is medium, r=0.50 is large. An r of 0.05–0.15 means the variable explains less than 2% of outcome variance.=0.05–0.15 with outcomes and introduces systematic bias (Ewens & Townsend 2020). We kept only dimensions with robust evidence and public-data measurability.

2. The Six Dimensions in Detail

Each card presents: what the research says matters (the ideal signal), not how we currently measure it. Data sources and measurement strategy are a separate concern — see the implementation plan.

1. Founder-Market Fit 25%

How deeply does this person know the specific market they're attacking? The Kauffman Foundation's longitudinal study of 5,000 firms found prior industry experience to be the strongest predictor of firm survival. Azoulay et al. 2020 analyzed 2.7 million founders using US Census + IRS data and found the average age of a top-growth founder is 45 — because domain expertise compounds with time. A 50-year-old is 1.8x more likely to build a top-growth firm than a 30-year-old.

Every major VC framework now weights this as a primary signal: Sequoia calls it "clarity of thought about the market," a16z calls it "proprietary insight," YC asks "do you understand your users deeply?"

Ideal signal: Years in target industry, domain-specific roles, patents/publications in the target domain, prior companies in the same vertical, depth of mental model of their customer (Opus identified this as the single biggest gap in the literature).

Key design choice: This dimension is relational (person × market), not absolute. The same person scores differently for a fintech startup vs a biotech startup. When the target market is unknown, we fall back to scoring general domain depth.

2. Prior Operational Evidence 20%

Gompers et al. 2010 tracked 14,000 VC-backed entrepreneurs and found founders with a prior successful exit have a 30% chance of success in their next venture, vs 22% for first-timers. Importantly, prior failure does not predict future failure (23% vs 22%) — so this should be scored asymmetrically: reward success, don't penalize failure.

The First Round 10-Year Project (300 companies, ~600 founders) found repeat founders achieve +50% higher valuations, and Big Tech alumni outperform by +160%.

Ideal signal: Prior founding outcomes (granular: acquisition price, IPO, revenue milestones — not binary success/fail), largest operational scope managed (team size, revenue), zero-to-one building experience. Serial entrepreneurship status (18–30% of EU entrepreneurs are serial; they outperform).

Built-in temporal decay: A 2010 exit in ad-tech carries less signal than a 2023 exit in AI infrastructure. Recency and market adjacency should be weighted.

3. Team Quality 20%

First Round Capital found 2-person founding teams outperform solo founders by +163%. A GoingVC analysis found that 65% of high-growth startups that fail do so from management team dysfunction, not market or product failure. A study of 25,430 European ventures confirmed team attributes as a top-tier predictor. Torssell found the optimum is a 50/50 mix of founders and hired executives, not all-founder teams.

Ideal signal: Number of co-founders, complementary skill coverage (technical + business + domain), prior co-founding or co-working history (Roure & Maidique 1986: joint experience was a primary differentiator), early hire caliber, gender diversity (+63% per First Round).

4. Network Position 15%

Bonaventura et al. 2020 (Nature Scientific Reports) analyzed 41,830 companies across 117 countries over 26 years. Their key finding: network centralityA measure of how well-connected a node is within a network. High centrality = connected to many important people, bridges between clusters. Computed via PageRank, betweenness, or eigenvector centrality. — a startup's position in the global talent-flow network — doubles the baseline VC success rate. Top-20 firms by centrality had ~30% success rates vs ~15% baseline.

This is not about name-dropping or LinkedIn connection counts. It's about structural position: does talent flow through this person's network? Are they connected to knowledge hubs?

Ideal signal: Closeness/betweenness centrality in co-employment networks, board seats, co-investor graphs, advisor network quality, employee flow patterns. The Nature study used proprietary co-employment data — public LinkedIn data is a proxy, not the real signal.

5. Technical Breadth 10%

Lazear 2005 proposed the "jack of all trades" hypothesis: entrepreneurs with broader skill profiles outperform specialists. First Round's data confirmed that technical co-founders yield +230% in enterprise — but -31% in consumer. Pure technical depth is sector-dependent; breadth is more universally predictive.

Ideal signal: Range of functional roles held, variety of technical domains (GitHub breadth, patent diversity, publications across fields), ability to evaluate talent across functions (a16z's "functional literacy"). Sector-weighted: higher for deep-tech/infra, lower for consumer/marketplace.

6. Leadership Magnetism 10%

Central to the a16z CEO evaluation framework: "Can the CEO get the company to do what needs to be done?" Ben Horowitz weights this "white box" evaluation (process and team-building ability) above "black box" results. Great leaders attract disproportionately strong early teams — the quality of first hires is a measurable proxy for leadership that's hard to fake.

Ideal signal: Caliber of early employees attracted (prior exits, elite backgrounds), team growth rate, public thought leadership (talks, writing, community building), speed of decision-making, "constructive confrontation" culture (Horowitz). This is the hardest dimension to quantify from public data, hence the 10% weight. GPT-Pro flagged integrity/epistemic honesty as a related missing signal.

3. How Top VCs Actually Evaluate

Our framework was validated against the stated evaluation criteria of the four most successful VC firms. Every firm weights founder-market fit as a primary signal; none weight academic credentials:

Firm	Primary Signal	Secondary Signal	Distinctive Take
Sequoia	Founder-market fit, clarity of thought	Evidence of velocity	"Fanatic dedication to product and customers" (Moritz)
a16z	Technical insight + courage of conviction	Storytelling / recruiting ability	Evaluate how you think (process), not just outcomes
Benchmark	Product intuition, intellectual honesty	Capital efficiency mindset	"Bet on the jockey" — conviction over process
YC	Determination (≠ stubbornness), execution speed	Deep user understanding	"What have you done since applying?" — the delta, not the snapshot

Framework alignment

Our 6 dimensions cover the observable signals underlying each firm's framework. Founder-market fit (25%) captures Sequoia's "clarity of thought" and a16z's "proprietary insight." Prior operational evidence (20%) captures Benchmark's "jockey" bet. Team quality (20%) captures the team dysfunction signal all four flag. Leadership magnetism (10%) captures a16z's "can they recruit?" The dimensions these firms can't articulate or measure — resilience, coachability, vision — are deliberately excluded because they require interaction, not data.

4. What We Deliberately Exclude (and Why)

Scoring on unobservable traits would produce confident-sounding noise. The system is designed to score only what public data can reliably measure, then surface the top 20–40% for human evaluation of what it can't:

We Score (public data)

Prior exits and operational track record
Network position in talent flow
Domain tenure and industry depth
Team composition and complementarity
Technical contributions and breadth
Talent attraction track record

We Surface for Human Eval

Strategic vision and clarity of thought
Decision-making under uncertainty
Resilience and response to failure
Team dynamics and communication
Coachability and self-awareness
Storytelling and persuasion ability

Academic pedigree is excluded as a standalone dimension. Tamaseb's analysis of 30,000 startups shows Ivy League attendance doesn't correlate with billion-dollar outcomes. Ewens & Townsend 2020 documents systematic investor bias toward elite credentials. Academic background contributes signal only when it's in the target domain — and that's already captured by founder-market fit.

5. Cross-Validation: Four Frontier Models

We gave all 28 source extractions (176 factors) to 4 frontier AI models independently via vario maxthink — Opus 4.6, GPT-5.4-Pro, Grok 4.1 Reasoning, and Gemini — and asked each: "What 6 dimensions would you recommend for an automated scoring system using only publicly observable data?"

Dimension	Opus	GPT-Pro	Grok	Gemini	Mean
Founder Human Capital / Market Fit	25%	25%	25%	20%	24%
Team Composition & Completeness	20%	20%	—	25%	22%
Network / Ecosystem Position	18%	10%	15%	25%	17%
Funding / Financial Trajectory	15%	10%	—	15%	13%
Market / Product Positioning	12%	15%	—	—	14%
Behavioral / Adaptability Signals	10%	20%	—	10%	13%
Education Pedigree (standalone)	—	—	15%	5%	5%

Dashes indicate the model folded that signal into another dimension rather than listing it separately. Grok kept education pedigree as standalone; all others demoted or excluded it.

Convergence (high confidence — all 4 agree)

Founder human capital / market fit is the top dimension — all 4 models weight it 20–25%
Team composition belongs in top 3 — unanimous; cited First Round (+163%), GoingVC (65% fail from dysfunction), Torssell (25K ventures)
Network centrality is a distinct, measurable signal — all 4 cited the Nature study (2x baseline success rate)
Education alone is NOT a standalone dimension — 3 of 4 models excluded or gave ≤5%; Opus notes r=0.085, GPT-Pro calls it “endogenous”

Divergence (interesting disagreements)

Execution / demonstrated progress: GPT-Pro uniquely gave 20% to “demonstrated execution” (release cadence, GitHub commits, product maturity), citing Mueller 2023 (R&D stage as top predictor). Others folded this into human capital or behavioral signals.
Funding as dimension vs endogeneity trap: Opus (15%) and Gemini (15%) include funding trajectory. GPT-Pro caps it at 10% warning it’s “highly leaky/circular.” Grok excludes it entirely.
Missing dimension (each model identified a different gap):
- Opus: “Founder-market cognitive fit” — depth of the founder’s mental model of their customer
- GPT-Pro: “Founder integrity / epistemic honesty” — consistency between bios, filings, and actual history
- Grok: Early traction / customer validation metrics (PMF signals, usage stats)
- Gemini: The conflict between “what is most predictive” and “what is actually measurable”

Research caveats we've built into the framework

Survivorship biasThe logical error of focusing on entities that passed a selection process while overlooking those that did not. In VC research: analyzing funded companies tells you what VCs select for, not what actually predicts success in the broader population.: Most VC research only studies funded companies. We're modeling what predicts success among the already-selected, not the general population. Effect sizes may be inflated.
Temporal decay: A 2010 exit in ad-tech carries much less signal for a 2025 AI startup than a 2023 exit in ML infrastructure. The scoring applies recency weighting.
Revealed vs stated VC preferences: a16z says they weight founder-market fit above credentials, but their portfolio skews toward Stanford/MIT/FAANGFacebook (Meta), Apple, Amazon, Netflix, Google (Alphabet) — the five dominant US tech companies. Often used as shorthand for "elite tech company background." founders. Our scoring follows the research, not VC portfolio composition.
Breadth over depth: Lazear 2005 found entrepreneurs with broader skill profiles outperform specialists. We score breadth, not depth — the "jack of all trades" is the better founder.

6. Methodology

Three channels, run concurrently using lib/ingest/literature_review.py:

Channel 1: Automated Discovery + Fetch + Extract

12 search queries → 36 results → 26 fetched → 28 with findings → 176 factors

Related-work survey via Serper API generated 36 candidate sources. Each was fetched with global semaphore (10 concurrent) + per-domain semaphore (2 per host). 26 fetched successfully, 9 paywalled (abstract-only), 1 failed. Each fetched source was analyzed via gemini-flash to extract structured records: title, authors, year, type, methodology, factors (with measures and strength), key findings, and limitations. The 9 paywalled sources still contributed metadata and abstract-level findings where available.

Channel 2: Structured Factor Extraction

28 sources with findings · 176 factors extracted · mapped to 6 dimensions

Each factor was tagged with strength level (strong/moderate/weak/anecdotal) and quantitative measure where available. Factor-dimension mapping: Founder-Market Fit (38 factors), Leadership Magnetism (15), Team Quality (11), Technical Breadth (10), Network Position (7), Prior Operational Evidence (4), Other (91 — including personality traits, financial metrics, and hiring process factors).

Channel 3: Multi-Model Cross-Validation

Opus 4.6 · GPT-5.4-Pro · Grok 4.1 Reasoning · Gemini

All 28 source extractions + 176 factors were given to 4 frontier models via vario maxthink. Each model independently proposed a 6-dimension scoring framework with weights. Convergences (all 4 agree) taken as high-confidence; divergences flagged. The synthesis above reflects the cross-model consensus.

Sources surveyed

With findings

176

Factors extracted

Models consulted

Source breakdown by type

Type	Count	Fetched	With Findings	Example sources
Academic papers	21	14	14	Gompers 2010, Bonaventura 2020, Torssell 2022, Pasayat 2023
VC practitioner	6	6	6	First Round 10-Year, a16z CEO Framework, Sequoia podcast
Industry reports	5	5	5	GoingVC, VCII Scorecard, QuicklyHire, TalentHub
Books	1	1	1	The Resilient Founder (Ramsinghani 2021)
Tools	3	3	2	PitchLense, Startup Analyzer MVP, Launch Checklist

7. References

All 36 sources surveyed. ✅ = full text fetched and analyzed. 🔒 = paywalled (abstract only). ❌ = failed to fetch.

Academic Papers (21)

✅ Bonaventura, M. et al. (2020). "Predicting success in the worldwide start-up network." Nature Scientific Reports, 10, 345. 41,830 companies, 117 countries. nature.com
✅ Torssell, J. (2022). "The Most Influential Team Attributes When Predicting Start-up Success." KTH thesis. 25,430 European ventures. diva-portal.org
✅ Pasayat, A.K. et al. (2023). "Factors Responsible for the Success of a Start-up: A Meta-Analytic Approach." IEEE Trans. Eng. Mgmt. Meta-analysis of 19 studies. ieeexplore.ieee.org
✅ Roure, J.B. & Maidique, M.A. (1986). "Linking prefunding factors and high-technology venture success." J. Business Venturing, 1(3), 295–306. sciencedirect.com
✅ Kaczmarek, M. & Kaczmarek-Kurczak, P. (2016). "Personality traits and self-efficacy as predictors of business performance." N=81 longitudinal. ceeol.com
✅ Plehn-Dujowich, J.M. (2010). "A theory of serial entrepreneurship." Small Business Economics, 35(4), 377–398. springer.com
✅ Di Giannantonio, R. et al. (2022). "The Impact of Machine Learning on VC Investment Process." Springer LNCS 13454. springer.com
✅ Mueller, C.E. (2023). "Explaining the stage of product in pre-seed academic startup ventures." J. Business Venturing Insights. N=223. sciencedirect.com
✅ Amaral, A.M. et al. (2011). "Serial entrepreneurship, experience and self-employment duration." Small Business Economics, 37, 1–21. springer.com
✅ (Literature review). "Predicting Startup Success, a Literature Review." Academia.edu. 132 success predictors surveyed. academia.edu
🔒 Unger, J.M. et al. (2011). "Human Capital and Entrepreneurial Success: A Meta-Analytical Review." J. Business Venturing, 26(3), 341–358. aom.org
🔒 Gompers, P. et al. (2010). "Performance Persistence in Entrepreneurship and Venture Capital." J. Financial Economics, 96(1), 18–32. sciencedirect.com
🔒 Ewens, M. & Townsend, R.R. (2020). "Selection and serial entrepreneurs." J. Econ. & Mgmt. Strategy. wiley.com
🔒 "A systematic literature review of startup survival." J. Research in Marketing & Entrepreneurship, 26(1). emerald.com
🔒 "A longitudinal study to assess the most influential entrepreneurial features." J. Small Business & Entrepreneurship. tandfonline.com
🔒 "Human versus computer: benchmarking VCs and ML algorithms." SSRN #3706119. ssrn.com
🔒 "Choosing the right friends – predicting success through online social network." IJODE. inderscienceonline.com
🔒 "Early stage technology investments of pre-seed VCs." IJEV. inderscienceonline.com
🔒 "Building and leading engineering teams." ResearchGate. researchgate.net
🔒 "Predicting the Success of Startups through Machine Learning." MDPI JRFM, 12(4), 189. mdpi.com
❌ "Startup Score Card: Free Download." startupbiz.com. startupbiz.com (404)

VC Practitioner (6)

✅ First Round Capital. "10 Year Project." 300 companies, ~600 founders. 10years.firstround.com
✅ Horowitz, B. "How Andreessen Horowitz Evaluates CEOs." a16z.com
✅ Horowitz, B. / Sequoia podcast (2026). "What Makes a Great Founder." sequoiacap.com
✅ Nikolaeva, A. "Early-Stage Startup Valuation: VC Method, Scorecard Method." anastasianikolaeva.com
✅ First Round Review. "Founder-Led Growth Playbook." firstround.com
✅ First Round Review. "Building a V1 of Customer Success." firstround.com

Industry Reports (5)

✅ Sable, M. / GoingVC. "How Venture Capitalists Evaluate Founders." goingvc.com
✅ VCII Institute. "The VCII Founder Evaluation Scorecard." vciinstitute.com
✅ FidForward. "Hiring strategy for startups." fidforward.com
✅ QuicklyHire. "Startup Talent Scaling Strategy." quicklyhire.com
✅ TalentHub. "17 Best Talent Acquisition Strategies." talenthub.eu

Books (1)

✅ Ramsinghani, M. (2021). The Resilient Founder. Wiley. google.com/books

Tools (3)

✅ mugdhav. "Startup Analyzer MVP." github.com
✅ henu-wang. "Startup Launch Decision Checklist." github.com/gist
✅ connectaman. "PitchLense." github.com

Additional References (cited in report, not from survey)

Azoulay, P. et al. (2020). "Age and High-Growth Entrepreneurship." AER: Insights, 2(1), 65–82. aeaweb.org
Lazear, E.P. (2005). "Entrepreneurship." J. Labor Economics, 23(4). uchicago.edu
Credé, M. et al. (2017). "Much Ado About Grit." JPSP, 113(3). 88-study meta-analysis. apa.org
Ewens, M. & Townsend, R.R. (2020). "Are Early Stage Investors Biased?" J. Financial Economics, 135(3). sciencedirect.com
Tamaseb, A. (2021). Super Founders. PublicAffairs. superfoundersbook.com
Kauffman Foundation. "The Anatomy of an Entrepreneur." kauffman.org

Glossary

Effect size: A quantitative measure of the magnitude of a phenomenon. Unlike p-values (which say "is it statistically significant?"), effect sizes say "how big is it practically?" Common measures include Cohen's d, odds ratios, and correlation coefficients.
r-value (correlation coefficient): Pearson correlation coefficient, ranging from -1 to +1. In social science research: r=0.10 is small, r=0.30 is medium, r=0.50 is large. An r of 0.05–0.15 (as reported for academic credentials) means the variable explains less than 2% of outcome variance.
Survivorship bias: The logical error of focusing on entities that passed a selection process while overlooking those that did not. In VC research: analyzing funded companies tells you what VCs select for, not what actually predicts success in the broader population.
Founder-market fit: The degree to which a founder's background, expertise, and lived experience align with the specific market and problem they're attacking. Distinct from product-market fit (which is about the product, not the founder). Example: a former hospital administrator founding a health-tech company has high founder-market fit.
FAANG: Facebook (Meta), Apple, Amazon, Netflix, Google (Alphabet) — the five dominant US tech companies. Used as shorthand for "elite tech company background" in founder evaluation.
Network centrality: A measure of how well-connected a node is within a network. High centrality = connected to many important people, or bridges between otherwise disconnected clusters. The Bonaventura 2020 study used this to predict startup success across 41,830 companies.
Crunchbase: The largest public database of startup companies, funding rounds, founders, and investors. Primary source for funding timeline data, company relationships, and founder histories.