Outcomes — Markov Trading Model Profitability

Goal

Determine whether Markov-based trading models can be substantiated as profitable trading approaches, separating genuine tradeable edge from academic backtest artefacts, regime-classification usefulness, overfitting, and marketing claims.

The full definitive answer is in Conclusion. This page is the Goal & Outcomes summary. Back to the index.

The claim under test

“A Markov trading model uses probability to analyse financial markets by predicting the likelihood of future price actions or market regimes based solely on the current state. It assumes that the current market condition depends only on the immediate past state, ignoring older data.”

Verdict on the claim: loosely accurate for a simple Markov chain (observed state, genuine first-order memory) — but misleading for the rest of the family: in an HMM and regime-switching model the state is hidden and inferred, not observed; an MDP is a control framework, not a price predictor; and practitioners routinely reintroduce older data via rolling windows and engineered features.

Verdict

Final verdict (round 10 — high confidence)

Markov-based models are substantiated as a regime-detection and risk-management component, not as standalone profitable trading systems. Regime classification reliably reduces volatility and drawdown; converting it into directional alpha that beats a benchmark after costs is not substantiated. No surveyed study reaches a strong evidence grade, and no credible public live track record exists. A deliberate steelman search for the best counter-evidence did not overturn this.

Evidence by model family

Model familyBest-evidenced useProfitability grade
Markov Chain Trading ModelDescriptive state/risk classificationweak
Hidden Markov Model Regime DetectionRegime / risk filterinconclusive for alpha
Markov Regime-Switching ModelDescriptive econometrics + risk filterinconclusive for alpha
Statistical Jump ModelRegime detection, lower turnover than HMMmoderate (single research network)
Markov Decision Process Trading ModelProblem formulation (Optimal Execution)inconclusive (cost-reduction, not alpha)
Reinforcement Learning Trading PolicyPolicy optimisationweak / negative

Evidence by dimension

Round-by-round log

  • R1 seed — 5 model families + evidence-standard concepts/risks mapped.
  • R2 evidence standard — transaction costs, overfitting, data-snooping, out-of-sample testing established as the grading backbone.
  • R3 regime use-mode — regime classification confirmed as risk control, inconclusive-to-weak for alpha; statistical jump model.
  • R4 foundational papers — Hamilton-lineage regime-switching is econometrics, not trading research.
  • R5 RL papers — RRL foundational work dated/un-replicated; deep-RL surveys document a reproducibility crisis.
  • R6 MDP / execution — optimal execution is genuine practice but cost-reduction, not alpha; order books are non-Markovian.
  • R7 live evidence — regime classification disclosed in production; standalone profitable Markov system absent from public record.
  • R8 asset classes — crypto negative-to-weak; equities strongest-but-bounded; FX/futures favourable venues, still backtest-only.
  • R9 failure modes & stubs — mechanical failure modes promoted; algorithm notes (Baum-Welch, Viterbi) sound, the data is the problem.
  • R10 steelman + recent work — best counter-evidence grades only moderate; 2024-2025 advances repeat the pattern (better backtests, same unsolved problems).

External framing

The project’s own practitioner-style assessment (gpt-markov-trading-model-assessment.md, project root) independently reached the same framing: use Markov/HMM as a regime layer above an existing strategy, with entry rules, exit rules, a risk model, costs and out-of-sample proof supplied separately. The vault’s peer-reviewed evidence corroborates that framing.

See also