Lookahead Bias from Smoothed Regime Estimates

This failure mode arises when an HMM backtest labels each period’s regime using full-sample smoothed probabilities — which condition on the entire observation sequence, including future data — and then “trades” those labels as if they had been known in real time. Because smoothed estimates use information from after the trade date, the strategy appears prescient when it is in fact peeking. Honest evaluation requires filtered or online inference, where the regime for day t uses only data up to day t; Bulla et al. (2011) and Shu/Yu/Mulvey (2024) both explicitly use rolling-window online inference for this reason. It appears in this vault as a key reason many HMM regime “profitability” claims do not survive proper out-of-sample testing.

Connections