StableBet
The Lab Β· Reference strategies

Are the AI's most-confident picks profitable?

The AI's most-confident picks lose -0.8% to SP over 361 real races. The closest thing to break-even we have measured, and still not a profit. The honest data.

Doesn't workTested on 361 racesROI: -0.8% (within noise)
18+ onlyResearch output, not adviceMethodology open Β· losses visible

Our in-house model lost 16.8% ROI on the pre-registered Oct-Nov 2024 backtest window.

This page publishes what it predicts and tracks every result. We do this because nobody else does β€” the methodology is open, the losses are visible, the analysis is honest. The model output is presented as a comparison to the market, not as a recommendation to back, lay, or stake on any runner.

Read the full methodology in our in-house AI horse-racing model write-up. Track the running ledger on the Stablebet track record page.

Gambling can be addictive. Please bet responsibly. Free, confidential support from GamCare, GamStop and BeGambleAware. See our responsible-gambling page for more.

The verdict

No, the AI's most-confident picks are not profitable. They are the least-bad bet on the board and still a loss, just one slow enough to hide inside the noise.

What this experiment settles

  • Do the AI's most-confident picks make money over a real backtest, or just look like they might?
  • Why does a model being sure about a horse not turn into profit at the betting window?
  • Is -0.8% to Starting Price a small edge, a small loss, or just statistical noise?

Methodology

Tested against the Stablebet betting-systems backtest, 27,909 GB races to industry SP, fallers settled as losses. Returns measured to industry SP, flat Β£10 win on the model's top-rated pick per race unless stated. The underlying ledger and per-race results are public at /our-track-record/; the model itself is described in the methodology write-up.

The claim

This is the most seductive entry on the whole board, and you can see why in one sentence: back the machine only when the machine is sure.

The pitch sells itself. We built an AI model that prices up every British race, runner by runner, off form, conditions and weather. Most of the time it is hedging, spreading its opinion across a field where it has no strong read. But every so often it plants a flag. It rates one horse 35 percent or better to win and effectively says, this one I am confident about. The claim is that those flagged runners, the strong fancies, are where the money lives. Ignore the model's noisy guesses, wait for the big confident picks, pile in, and let the clever computer do the hard work.

It is the cleanest logic on the page. A confident filter feels like it should strip out the rubbish and leave only the gold. The sample is large, the model is well calibrated, and the headline number, when you finally see it, lands a whisker from break-even. To a hopeful eye that reads as basically free money waiting for one small tweak.

So we tested it honestly, the same way we test every system in the Lab. Flat stakes, real prices, every confident pick settled as it actually finished. The question is simple and it is the one every punter who has ever trusted a tipster wants answered. When the smart model is genuinely sure, can you make money following it? The result is the closest any system came to passing. It still did not pass.

Why everyone swears by it

The appeal runs deeper than the other systems we test, because this one is not obviously daft. Doubling your stake after a loss is a maths trap you can explain in a minute. Backing only an AI's strongest fancies sounds like exactly what a sensible person should do.

The selectivity flatters it. When you throw away the model's worst longshot guesses and keep only the confident shortlist, you are left with well-fancied, short-priced runners that win a lot. Watch them go in race after race and it feels like proof. The strike rate is high, the losers are rare, and the running total drifts along close to flat instead of bleeding out the way a random or longshot strategy does. Nothing about the experience screams losing system, which is precisely the problem.

Then there is the number itself. About -0.8% reads, to a hopeful eye, as nearly winning. People look at a figure that close to zero and assume the gap is a rounding error or a small fee, the sort of thing one clever adjustment would erase. They are already spending the profit that is not there.

Underneath it all sits a single confusion, and it is the most important idea on the page. People run together two completely different things. One is a calibrated model knowing which races it has a genuine read on. The other is a model that can turn that read into profit. The first is real and the AI genuinely has it. The second is not. Pascal looks at -0.8% and hears it as nearly winning, a machine that is almost there. The Professor looks at the same number and sees a system that is simply losing slowly.

How it loses

Here is the quiet reason the confident picks land near break-even, and it is not the reason people hope.

When the model is genuinely sure about a horse, it is almost always agreeing with a short price the market has already got roughly right. That matters, because of where the bookmaker's margin sits. Short prices carry the thinnest slice of the over-round. Odds-on favourites lose only about 7 percent, the least-bad bet we have measured anywhere, against -37.3% for outright outsiders. A basket of confident, shorter-priced runners inherits the least-padded corner of the market. That is the whole secret. It is not skill. It is renting the least expensive seat in the house.

The catch is fatal to the dream. The model has no market price in its features, no speed figures and no Betfair starting price. It cannot out-read the crowd on these races because it is reading the same form the crowd already read. By the time the machine and the market both agree a horse should win, the bookmaker has already shortened it to match. There is no spare value left to take. The model is echoing a price, not beating it.

So the arithmetic closes in. The over-round is baked into every bet, about 12 percent per race and up to 30 percent in big fields. Even the sharpest, shortest runners bleed a little on average. A steady drip of -0.8% across hundreds of bets is still a loss, just a polite one. The confident filter buys you the cheapest corner of a rigged price, and the cheapest corner of a losing bet is still a losing bet.

How we tested it

We ran this the same brutal way we run everything in the Lab, with no thumb on the scale for the AI.

The pool is 27,909 real British races. For the confident system we kept only the runners the model rated 35 percent or better to win, the picks where the machine plants its flag. That left 361 bets. Every one was staked flat, the same notional amount, so a lucky big-priced winner cannot quietly carry the whole record. Then every bet was settled to industry Starting Price, the odds you would actually have got walking up at the off, not some flattering forecast price.

The settling rules are where most published backtests cheat, so we are blunt about ours. Fallers count as losing bets. Pulled-up horses count as losing bets. A horse that was sent off your confident pick and unseated at the second is a loss, full stop, because that is what it costs you in real life. Joint-favourites are split rather than double-counted. No Betfair commission is charged, which if anything makes the system look better than reality, not worse.

That last point matters here more than on any other page. This is the one system that brushes break-even, so the way you settle it decides everything. An earlier version of this very test wrongly dropped the fallers, and that single mistake made the whole thing look like a small profit. Put the fallers back where they belong, as the losing bets they are, and the number slides under zero. We would rather publish the honest -0.8% than the flattering fiction, because the flattering fiction is exactly what every tipster on the internet is selling you.

The numbers

Here is the result, plainly. Across the backtest the AI's most-confident picks returned about -0.8% to Starting Price over 361 bets. Read that honestly. It is a loss. It is a very small one, and it sits inside the error bars of break-even rather than proving any edge, but it is still a minus sign, not a profit.

The strike rate is high, and that is the part that fools people. These are short-priced, well-fancied runners, so a healthy share of them win. But a high strike rate on short prices is exactly what break-even looks like. You win often, you collect small returns each time, and the over-round skims its cut off every price, so the total still drifts gently into the red. Winning most of your bets and still losing money is not a paradox. On short prices it is the default.

Now the honest health warning on the -0.8% itself. The figure is fragile and easy to flatter. The confident sample is small at 361 bets and heavily skewed towards National Hunt, so any brush with zero is happening on a thin, jumps-heavy slice of history. The 95 percent range around the number comfortably includes both a small profit and a small loss, which is another way of saying the result is statistically indistinguishable from zero. Knock the rounding either way and you are still hovering at break-even, with the honest reading a slight loss.

And this is to Starting Price with no commission charged. Add real Betfair commission, or simply run the same confident filter forward on a normal mix of fresh races, and the wafer-thin cushion is gone. Out of sample it reverts to a loss. There is no number here you can build a stake on.

The verdict

So, are the AI's most-confident picks profitable? No. They are the least-bad system we have measured and still not a way to make money.

About -0.8% to Starting Price over 361 real races is a small, fragile loss that lives inside the noise of break-even, and it only ever brushed zero on a thin, jumps-heavy past sample to SP with no commission charged. It suits nobody as a staking plan. Charge real Betfair commission, or just run it forward on a normal mix of races, and the cushion vanishes. Treat the -0.8% as a backward-looking, no-commission curiosity, never as a wage and never as a reason to bet.

But do not throw the model out with the bathwater, because the real lesson here is the most useful thing on the whole board. The confident filter proves two true things at once. It proves the AI knows which races it can read, because by refusing to bet its worst longshot guesses it dodges the roughly -11% loss of backing its own top pick blind, much like the -12.5% you take blindly backing the favourite. That is genuine insight. And it proves the same AI cannot beat an efficient market, because it has no price, no speed figures and no Betfair SP to beat it with. A model knowing which races it has a read on is real and worth having. It is simply not the same thing as an edge.

That is the honest shape of this experiment, and it is why we publish it as confidently as any winner. The machine is clever enough to know what it does not know. It is not clever enough to make the over-round disappear, because nothing is. If you want to see whether we hold any edge worth staking on, we publish that either way, losses included, in the track record.

Frequently asked questions

Are the AI's most-confident picks profitable?
No. Backing only the runners the model rates 35 percent or better returned about -0.8% to Starting Price over 361 bets. That is a loss, not a profit, even though it is the closest to break-even of the 20 systems we tested. The result sits inside the error bars of zero, so the honest reading is a small, fragile loss rather than any edge.
If it is nearly break-even, why isn't that good enough to bet on?
Because -0.8% is still negative, the sample is small and jumps-heavy, and the figure was measured to Starting Price with no Betfair commission charged. Charge real commission, or just run the same filter forward on a normal mix of races, and the wafer-thin cushion disappears. There is no number here you can build a stake on.
What was the strike rate on the confident picks?
These are short-priced, well-fancied runners, so they win often by racing standards. The strike rate on the small 361-race sample was high enough to flatter the eye, but a high strike rate on short prices is exactly what break-even looks like. Winning a lot of small-priced bets and still losing 0.8% is the whole story, not a contradiction.
Does the model being confident mean the horse is a good bet?
No, and that is the key confusion. The model being sure a horse should win is not the same as that horse being a good price. By the time the model and the market agree, the bookmaker has already shortened the horse to match, so there is no spare value left and the overround quietly does its work.
Does this prove the AI is useless?
The opposite, in a narrow way. The confident filter strips out the model's worst longshot guesses, so it dodges the roughly -11% loss of backing its own top pick blind. That shows the model knows which races it can read. Knowing which races you have a read on is genuine insight. It is just not the same thing as an edge you can profit from.

What this experiment doesn't cover β€” and what we're testing next

Other Lab experiments