Skip to main content
Jonathan Becker
Jonathan BeckerMar 8, 2025 · 21 min read

Inside the Mind of the Prediction Market Trader

preview

The efficient market hypothesis suggests that asset prices perfectly aggregate all available information (Fama, 1970). Prediction markets—where contracts pay $1 if an event occurs and $0 otherwise—should be the purest test of this theory. A contract trading at 50 cents should reflect exactly a 50% probability.

We analyzed 72.1 million trades on Kalshi, a CFTC-regulated prediction market, covering $18.26 billion in notional volume across 7.68 million markets. What we found challenges the efficient market hypothesis in systematic, predictable ways.

Kalshi Quarterly Volume

Kalshi's growth provides a natural laboratory for studying market efficiency. Quarterly volume grew from $3.9 million in Q3 2021 to $8.8 billion in Q4 2025—a 2,200x increase. This explosive growth, particularly the acceleration in 2024-2025, means our dataset captures both a nascent market finding its footing and a maturing market with substantial liquidity.

Part I: The Longshot Bias

Quantifying Miscalibration

The foundational anomaly in prediction markets is the longshot bias: low-probability events are systematically overpriced (Thaler & Ziemba, 1988). Our data confirms this with exceptional granularity.

Actual Win Rate vs Contract Price

The calibration curve reveals the characteristic S-shape predicted by Prospect Theory's probability weighting function (Kahneman & Tversky, 1979). But the raw calibration chart understates the economic magnitude. Converting to percentage mispricing reveals the true scale:

Mispricing by Contract Price (%)

The mispricing is severe and asymmetric; at 1 cent, contracts win less than half as often as their price implies. An investor systematically buying 1-cent contracts would lose 57.5% of their expected value. For comparison, slot machines—the casino game most associated with terrible odds—return about 93% to players on the Las Vegas strip. Buying 1-cent prediction market contracts offers roughly half the expected return of pulling a lever at the Bellagio. This isn't noise—with 1.34 million trades at that price level, the statistical significance is overwhelming.

The mispricing follows a clear pattern: it's largest at the extremes (particularly low prices), converges toward fair value in the 20-40 cent range, and shows slight underpricing of favorites above 70 cents. This is precisely the shape predicted by Kahneman and Tversky's probability weighting function, where humans overweight small probabilities and underweight large ones. Prelec (1998) provides a formal mathematical treatment of this weighting function.

Why Does This Persist?

In traditional financial markets, arbitrageurs would eliminate such predictable mispricing. Several factors explain its persistence in prediction markets:

1. Position Limits and Capital Constraints. Kalshi imposes position limits, preventing sophisticated traders from fully arbitraging away the bias. Even a $25,000 position limit on a 5-cent contract yields only $1,250 of exposure—not enough for institutional capital to care.

2. Carrying Costs. Selling overpriced longshots requires locking up capital until market resolution. A trader selling 1-cent contracts must post 99 cents of margin per contract for potentially months. The annualized return, even on a massively mispriced contract, may not compensate for the capital lockup.

3. Counterparty Demand. The longshot bias requires willing buyers. As we'll see, retail traders provide this demand consistently, treating prediction markets like lottery tickets.

Part II: The Lottery Ticket Effect

The aggregate mispricing data might suggest a market-wide inefficiency. But examining trading behavior reveals a more specific story: certain traders systematically buy overpriced contracts.

Number of Trades by Contract Price

Trade frequency spikes dramatically at 1-2 cents—1.41 million trades at 1 cent alone, nearly double the count at any mid-range price. This concentration at extreme low prices cannot be explained by market mechanics; it reflects deliberate preference.

Average Contracts per Trade by Price

The behavioral signature becomes unmistakable when examining contracts per trade. At 1 cent, traders purchase an average of 548 contracts per trade—2.5x more than at 50 cents. The pattern is monotonic: as prices decrease, position sizes increase.

This is lottery ticket behavior. The psychology is straightforward: at 1 cent per contract, a $50 bet buys 5,000 contracts. If the event occurs, that $50 becomes $5,000—a 100x return. The asymmetric payoff structure activates the same mental accounting that drives Powerball purchases. Traders focus on the potential upside ("I could make $5,000!") while underweighting the probability ("It's only 1%... but maybe").

The aggregate numbers tell the story: traders placed 1.41 million trades at 1 cent, purchasing 773 million contracts, but won only 5,711 times—a win rate of 0.43%. The expected value destruction is massive: these traders collectively bet on outcomes with 1% implied probability that actually occurred 0.43% of the time.

Part III: Smart Money vs. Dumb Money

If retail traders are systematically overpaying for longshots, who is taking the other side?

Excess Win Rate by Trade Size

Trade size is the clearest predictor of performance. Examining excess win rate (actual win rate minus price-implied probability) across trade size buckets:

Trade SizeExcess Win RateN Trades
$0.10-1.45%513,639
$1.00-1.24%3,205,279
$10-1.08%6,232,777
$100-0.87%3,543,920
$1,000-0.35%544,586
$5,623+0.39%94,903
$10,000+0.51%26,495
$56,234+2.57%988
$100,000+9.59%253

The crossover point occurs around $5,600. Below this threshold, traders systematically lose money relative to fair value. Above it, traders generate positive excess returns.

This pattern is consistent with Kyle's (1985) model of informed trading. In that framework, informed traders choose larger position sizes because they have conviction in their information advantage. Uninformed traders, trading for entertainment or liquidity reasons, take smaller positions. The market maker (or in this case, the aggregate of limit order providers) extracts value from uninformed flow while paying for informed flow.

The magnitude matters: small traders (<$100) lose 1.2-1.5 percentage points per trade on average. With 10+ million trades in this category, that represents tens of millions of dollars transferred from retail traders to sophisticated counterparties annually.

Part IV: Category-Level Efficiency

Not all markets are created equal. Breaking down mispricing by category reveals dramatic differences:

Mispricing by Category and Price Level

CategoryVolume (billions)Market CountMispricing Pattern
Sports$12.653,597,627Severe at extremes
Politics$2.4010,033Moderate, centered
Crypto$0.841,868,821Volatile, noisy
Finance$0.701,672,851Best calibration

Sports markets exhibit the most severe mispricing, with longshot contracts (1-10 cents) showing 50-100%+ mispricing. This aligns with the market's purpose: sports betting is entertainment, and entertainment value comes from the possibility of a big win, not expected value optimization. Snowberg & Wolfers (2010) explore whether this bias stems from risk-love or probability misperceptions.

Politics and finance markets show tighter calibration across the price spectrum. These categories attract traders attempting to forecast outcomes rather than seeking entertainment. The lower volume per market (politics averages $239K/market vs. sports at $3.50K/market) suggests more concentrated, deliberate trading.

Volume Share by Category

The category composition explains why aggregate mispricing statistics may understate efficiency in "serious" forecasting domains. Sports—the least calibrated category—represents 69% of total platform volume. Weighted by volume, the platform is primarily a sports betting venue with a forecasting platform attached.

Part V: Temporal Dynamics

Intraday Patterns

Trading activity follows predictable patterns that reveal market composition:

Trading Volume by Hour

Volume peaks between 7-10 PM ET (4-5.3 million trades/hour), when retail traders are home from work. It troughs between 3-5 AM ET (197-258K trades/hour).

But the critical finding is that efficiency varies inversely with retail participation:

Market Efficiency by Hour (Excess Win Rate %)

Time (ET)Trades (M)Avg Trade SizeExcess Win Rate
3 AM0.26$75-2.30%
8 AM1.80$105-0.70%
9 AM2.45$103-0.92%
10 AM3.00$97-0.79%
10 PM5.33$144-1.39%

The 3 AM to 10 AM comparison is striking: despite 10x lower volume, overnight markets are 3x more mispriced. The mechanism is clear from trade size: overnight average trades are $75 vs. $103-105 during morning business hours.

Average Trade Size by Hour ($)

The evening paradox is notable: 10 PM has both the highest volume AND the largest average trade size ($144), yet excess returns are still -1.39%. High volume doesn't guarantee efficiency—the composition of that volume matters. Evening trading mixes sophisticated traders closing positions with retail traders making entertainment bets.

Day of Week Effects

Market Efficiency by Day (Excess Win Rate %)

Weekend volume increases (likely driven by sports markets), but efficiency decreases. Every day shows statistically significant negative excess returns (all p-values < 0.05), but weekdays cluster around -0.9% while weekends approach -1.1%.

Part VI: Price Action and Overreaction

Momentum and Mean Reversion

Do traders respond rationally to price movements, or do they systematically over- or under-react?

Excess Returns by Recent Price Movement

We measured excess returns conditional on recent price movement (the change over the previous 5 trades):

Recent Price ChangeExcess Win RateN Trades
< -10 cents-2.71%1.80M
-5 to -2 cents-0.67%4.02M
No change (0)-0.97%27.12M
+2 to +5 cents-1.12%4.09M
+5 to +10 cents-2.04%2.20M
> +10 cents-5.37%1.83M

Traders who buy after large price increases (>10 cents) suffer 5.37% excess losses—the worst performance of any category we measured. This is textbook momentum chasing: prices spike on news, retail traders pile in at the top, and mean reversion follows.

The asymmetry is notable: buying after crashes (-10 cents) yields -2.71% excess returns—bad, but half as bad as buying after spikes. Contrarian strategies don't generate alpha, but they avoid the worst behavioral traps.

Market Timing

When during a market's lifecycle do traders participate?

Volume by Time to Market Close

The concentration of activity near resolution is extreme. Examining volume by percentage of market lifetime elapsed:

% Through LifetimeVolume ($ billions)% of Total
0-10%$0.141.7%
10-20%$0.172.1%
80-90%$0.577.1%
90-100%$5.7470.6%

Over 70% of all trading volume occurs in the final 10% of a market's lifetime. This has important implications:

  1. Limited forecasting value. If markets only achieve substantial liquidity near resolution, they cannot serve their proposed function of aggregating information early, when forecasts would be most valuable.

  2. Information vs. reaction. Late-stage trading likely represents reaction to public information rather than private forecasting. By the time 70% of volume arrives, the outcome is often nearly determined.

  3. No early-mover advantage.

Excess Returns by Entry Timing

Traders entering early (0-10% through lifetime) show excess returns of -1.0%, nearly identical to traders entering late (90-100%) at -1.0%. Entry timing does not predict performance—consistent with the hypothesis that price discovery happens primarily through the market mechanism, not through timing skill.

Part VII: Market Duration and Efficiency

How long a market stays open significantly impacts its behavior:

Calibration Bias by Market Duration (Excess Win Rate %)

DurationMean Abs ErrorExcess Win RateAvg Trade SizeTrades (M)
1-4 hours34pp-1.10%$595.8
4-12 hours32pp-1.27%$751.7
1-3 days34pp-1.04%$9319.1
1-4 weeks35pp-1.51%$15719.0
>4 weeks30pp-0.51%$17310.5

Long-duration markets (>4 weeks) show the best calibration (-0.51% excess returns) and the largest average trade sizes ($173). This pattern suggests:

  1. Selection effects. Traders willing to lock up capital for 4+ weeks may be more sophisticated and better calibrated than those seeking quick entertainment.

  2. Convergence time. Longer markets allow more time for information to be incorporated and mispricing to be arbitraged away.

  3. Category composition. Long-duration markets are more likely to be politics/finance (better calibrated categories) than sports (typically short-duration).

Part VIII: Behavioral Signatures

Round Number Clustering

Trade Ratio at Each Price (observed/expected)

Trades cluster at psychologically salient prices. The trade ratio (observed trades / expected trades if uniformly distributed) reveals:

Price TypeTrade RatioExample Prices
Extreme (1-3¢, 97-99¢)1.29x99¢: 1.43x, 1¢: 1.94x
Key Round (25, 50, 75, 90)1.04x50¢: 1.28x
Other Round (multiples of 5)1.01x5¢, 15¢, 35¢
Non-round0.97x47¢, 63¢, 82¢

The 50-cent price—maximum uncertainty—attracts 28% more trades than expected. The 99-cent price attracts 43% more. This clustering is statistically significant (χ²=401,343, p<0.001) and suggests traders anchor on round numbers rather than calculating precise fair values.

Liquidity and Efficiency

Average Bid-Ask Spread by Price

Bid-ask spreads show a striking pattern: widest at 50 cents (maximum uncertainty), narrowest at the extremes. The correlation between spread and volume is ρ = -0.993 (p < 10⁻¹⁰)—nearly perfect negative correlation.

The spread-accuracy relationship (ρ = 1.000) confirms the intuition: wider spreads correlate with worse price accuracy. This creates a feedback loop where illiquid markets remain inefficient because the cost of correcting mispricing exceeds the expected profit.

Synthesis: Who Wins and Who Loses

The evidence points to a consistent picture:

Winners:

  • Large traders (>$5,600 per trade) generating 0.4-9.6% excess returns
  • Traders active during business hours (8-11 AM)
  • Participants in politics/finance markets
  • Those avoiding momentum chasing

Losers:

  • Small traders (<$100) losing 1.2-1.5% per trade
  • Overnight traders (2-5 AM) losing 2.3% excess
  • Sports/crypto traders at extreme prices
  • Momentum chasers buying after >10% moves, losing 5.4%

The aggregate wealth transfer is substantial. With $18.26 billion in volume and an average mispricing of roughly 1%, the behavioral tax extracted from unsophisticated traders approaches $150-200 million in our dataset.

Conclusion

Prediction markets are not efficient in the academic sense. They exhibit systematic, predictable biases that persist despite substantial liquidity. The longshot bias alone—a 30-60% mispricing at low prices—represents a deviation from rational expectations that would not survive in traditional financial markets.

Yet the inefficiency is not random. It follows precise patterns:

  1. It's concentrated. Mispricing is severe at price extremes and minimal at 20-40 cents.
  2. It's categorical. Sports and crypto markets are entertainment venues; politics and finance are closer to forecasting tools.
  3. It's temporal. Overnight and weekend trading is worse than business hours.
  4. It's size-dependent. Small traders subsidize large traders.

These patterns are consistent with a market that successfully aggregates information from sophisticated participants while extracting entertainment value from retail traders. The prediction market, in this view, is both a forecasting mechanism AND a gambling venue—and the two functions coexist because they serve different user populations.

For those seeking to use prediction markets as forecasting tools, the implications are clear: focus on politics and finance, trade during business hours, ignore price momentum, and weight large trades more heavily than small ones. For those seeking to profit, the path is equally clear: provide liquidity to retail traders buying overpriced longshots, particularly in sports markets during evening hours.

The efficient market hypothesis may be beautiful, but prediction markets reveal something more interesting: a system where efficiency and inefficiency coexist, segmented by participant sophistication, market category, and time of day. The market knows the truth—it just charges different prices to different people for access to it.


Resources & Citations


More Reading

Databroker.global Smart Contract AuditOct 17, 2021

The goal of this audit is to review Databroker's smart contract and find and demonstrate potential security issues within the solidity contract. This report will also focus on current Solidity & security best practices.

Adidas Originals NFT Drop PostmortemDec 18, 2021

Adidas recently partnered with popular NFT brands gmoney, Bored Ape Yacht Club, and PUNKS Comics to release a collection of NFTs called "Into The Metaverse". For their mint, Adidas set a limit of two NFTs per address, yet someone managed to mint 330 tokens in one singular transaction.

On Decoding Raw EVM CalldataDec 20, 2023

With the 0.7.0 release of Heimdall, the toolkit gained the ability to decode raw EVM calldata. In this article, we'll dive into the inner workings of calldata decoding and explore some of the use cases for this new feature.