PHBench - Predict the next Series A from a ProductHunt launch

PHBench: the first public benchmark predicting Series A funding from Product Hunt launch signals. We analyzed 67,292 featured launches over 7 years, linked to 528 verified Series A rounds via Crunchbase. Champion model: 4.7x lift over random. Team size Γ— community engagement is the strongest signal; B2B (API, Payments, Fintech) converts at 3x baseline; Rank #1 raises at 2.2x unranked. Dataset, code, and baselines open. Submit at phbench.com and subscribe for weekly high-probability launches.

Add a comment

Replies

Best

@rajiv_ayyangar, thank you so much for hunting us!

Hey PH Community πŸ‘‹

We're , a Senior Technical Product Manager at Amazon and an independent researcher and , co-founder and GP of Vela Partners. Today, we're launching in collaboration with the University of Oxford ( and ) and , the leading quant VC.

And yes, the irony of launching a Product Hunt benchmark on Product Hunt is completely intentional πŸ™‚

Here's the origin story. We kept asking a question nobody had answered: Can you predict which Product Hunt launches will raise Series A funding, based solely on what you see on launch day (votes, rank, team size, category, timing)?

So we built PHBench. We collected 67,292 featured PH launches going back to 2019, matched them to Crunchbase funding records, and identified 528 verified Series A raises within 18 months. Seven years of data. Every featured launch.

Three findings I think this community will find interesting:

β†’ The signals work. Our model is 4.7x better than random. Statistically significant.

β†’ The strongest predictor isn't votes alone. It's team size Γ— community engagement together. A large coordinated team achieving high traction is more predictive than either signal alone.

β†’ B2B categories convert at 3x the baseline rate. API, Payments, Fintech. If you launch a developer tool on a Tuesday with a big team and high engagement, that's a strong signal.

We also tested three frontier Gemini models on the same task. The most capable model performed the worst. Better reasoning doesn't help with pure numbers.

The dataset is available on . The leaderboard is . The is public. Can you beat our baseline?

The paper is on and has been submitted to the NeurIPS 2026 Evaluations & Datasets Track.

Would love your feedback β€” especially from anyone who's launched on PH and gone on to raise Series A. You're in our dataset :)

Been quietly working on this with Yagiz, Yigit and Rick for a while.


While I mostly focus on using founder profiles to predict raises, PHBench tries the same prediction but from the product side. A similar question but from the other side.


Have a go at the leaderboard if you fancy; the data's on HuggingFace.

Β thank you for your valuable contributions. Excited to incorporate ProductHunt into founder prediction pipeline.

Really cool idea, Good luck!

Β  Β thank you Cem!

Β  Β appreciated!

Are you using only launch day signals, or do you include post launch traction like follows and comments over the first week?

Β The core signals are captured on launch day (votes, comments, daily rank, maker profiles, topic tags).

One caveat: maker follower counts were scraped in 2026, not at launch time, so for older launches, they reflect post-funding growth. It's a limitation we document in the paper.

Adding richer post-launch features like 7-day comment growth or follow-on engagement would be a great extension. We think there's a lot of untapped signal there.

Full details on the feature set are in Section 5 of the paper:

Interesting, most people assume raw upvotes are the proxy for quality. So the finding about team size Γ— community engagement being a stronger signal than votes alone is genuinely counterintuitive but very curious. Have you looked at whether solo founders who hit high engagement are penalized by this model? Do they show up as a distinct cluster? Would love to see how the signal degrades for truly first-time founders vs. repeat ones. Incredible dataset, congrats on getting years of data cleaned!

Β Thanks! And yes, the votes finding surprises everyone! Raw upvote count is actually one of our four "noise" features. It has high model importance but near-zero conditional lift. The reason: viral launches with 500+ votes are often consumer products riding a wave of hype that doesn't translate to institutional funding. The strongest signal is votes combined with daily rank. A #1 launch with high engagement raises Series A at 3.5x the baseline but votes without a strong rank is noise. Maker team size is #2 in importance, and maker follower count is #6 but carries a higher lift (2.4x vs 1.2x for team size alone). Suggesting that who's on the team matters more than how many.

On solo founders: we haven't done the cluster analysis you're describing, but the data is suggestive. Solo founders (maker_count = 1) underperform teams of 2-3, with a modest 1.2x lift for teams vs. baseline. But the bigger signal is follower count: a solo founder with a large following performs fine; a solo founder with no following is where the model gets skeptical.

We don't currently distinguish first-time vs repeat founders. That's a great feature idea, but maker IDs are redacted in the PH API for privacy, so it's not something participants can compute today. It would require a partnership with Product Hunt to access that signal. what do you think :)?

If you're curious about digging in, would love to see you submit a model:) You can get the full dataset from here:

Β  Β  Β 

I'll chime in with insights from other models and datasets :)

Repeat entrepreneurs are more likely to succeed than first-time entrepreneurs. However, repeat entrepreneurs that raised some money and failed is worse than first-time entrepreneurs.

So you're better of investing raw first-time entrepreneur in most cases than unsuccessful repeat entrepreneurs.

The models in other datasets is superbly clear. Solo founder is a negative signal but of course it's not a signal in which would cause a model to reject. It's just a weight in decision making.The key suggestion is that you need to be building a team to increase the chance of success.

X Corner Random Encounter: A Quick Take on PHBench 🧠

Huge congrats to Yagiz, Yiğit, and the Vela Partners team on this launch!

I just bumped into Yagiz and Yiğit at the X corner an hour ago. I threw a quick challenge at their core assumption, and they gave me some incredible, patient replies. Here is our quick chat. (With help from Gemini to organize my thoughts into this post!)

My Question on Twitter: Many top launches already raised seed money. They use PH as an amplifier, not a starting point. It feels like the high engagement vs. Series A is more of a correlation rather than causalityβ€”the "strong signal" might just be a lagging indicator of their pre-existing capital/resources.

Yagiz's Backdoor Insights: Yagiz dropped a total golden nugget. He told me: "88.3% of the 528 launches that raised Series A in our dataset had prior seed funding."

(And oh my god, that means I’m officially part of the 11.7% "naked" bootstrapped builders trying to survive with zero funding, haha! πŸ˜…)

Yagiz honestly framed that PHBench is a predictive benchmark, not a causal study. They don't claim causality. But he noted that while a funded team can buy upvotes, consistently landing Rank #1 is harder to manufacture.

Yiğit's Side Note: Yiğit also shared another fascinating data point with me: "Consistent social media posters increase their likelihood of success. However, not posting doesn't decrease your chance. It’s just neutral."

I guess those successful but quiet builders must have massive "invisible" networks that online data simply cannot capture.

My Takeaway: This conversation completely shifted my view. PHBench is essentially a "Funnel Accelerator" for VCs.

It does not discover hidden gems from absolute scratch. Instead, it predicts which already-backed teams have the top-tier GTM execution to dominate the market on launch day. If you already have seed money, PH is the ultimate stress test for your team.

My Personal Note:

To wrap up, I want to say how much I truly appreciate their hard work. Data cleaning and dataset building are brutal, sweating jobs. Really appreciate to see them doing all this heavy lifting and unselfishly open-sourcing the whole thing to the community. Thanks a lot.

Β Great writeup! Your "funnel accelerator" framing is great! Might steal that :)

Welcome to the 11.7% club. The bootstrapped builders who make it without seed funding are the real outliers in the dataset. πŸ’ͺ

Thanks for the kind words about the open-source work. That's exactly why we built it so the community can poke holes, challenge assumptions, and make it better.

 Haha, use it. It is my honor! 🀝

Β thanks for taking the time to exchange thoughts - love how you think about this.

We're rooting for bootstrapped founders.

In fact, Vela Partners is a bootstrapped business like yours, no equity funding! ;)

Really excited to bring PHBench to you guys! By extending the short-term productivity signals on Product Hunt to predict long-term funding materialization, we help to identify outlier products that are truly valuable in the VC environment. We think it will be greatly beneficial to the Product Hunt community.

Come to beat our baseline and get to the top of the leaderboard!

Β excited to see more predictors in ProductHunt community to join us! :)

So excited to see this live! This has been a labor of love, collecting data, running +100 experiments, and testing LLMs against good old gradient boosting.

The leaderboard is open. If you can beat us, you're the new champion. Who's in?

Β looking forward to seeing some competition soon!!!

Would be interesting to see a breakdown of false positives: high PH engagement but no Series A. That’s often where the real insight is.

Β Totally agree. The false positives are where it gets interesting.

Our model's top 50 test predictions have 10% precision, meaning 5 out of 50 are genuine Series A raises. That's 13x better than random chance (0.76% base rate), but it still means 45 out of 50 are false positives.

A few patterns we've noticed in the false positives: consumer/social products that go viral on launch day but lack the B2B unit economics VCs want, projects by well-followed makers that are side projects rather than fundable companies, and launches from 2022-2023 that hit strong signals during the funding winter when conversion rates dropped to 0.5%.

The data is fully open. A "false positive autopsy" on the top 100 predicted-but-didn't-raise would be a great community contribution. If you're up for it:)

Β  Β that's a great question.

We have an LLM-native ML library at , we could use that to analyze and report back some qualitative analysis.

After today's launch, we all expect to see PHBench's chances of hitting Series A based on its own model. Good luck!

Β Haha! I did run our launch through the model just for fun: daily rank looking strong, maker team of 4, developer tools category. The model would like our odds.

Unfortunately, PHBench itself isn't raising a Series A:) We're an open research benchmark, not a startup.

Β  Β haha nice one!

We're researchers trying to help the founder community!

12
Next
Last