Curse of Dimensionality

The curse of dimensionality, a term coined by Richard Bellman in his 1957 book Dynamic Programming, is the failure mode in which the state (and action) space of an MDP grows so large that exact Dynamic ProgrammingValue Iteration or policy iteration — becomes computationally infeasible. It appears in this vault because it is the binding limitation on the classical MDP route: Pedersen 2023 reports that the dynamic-programming solution of the optimal-execution model “is infeasible for large portfolios,” which is the practical reason reinforcement learning and function approximation are used to find near-optimal policies instead of exact ones.

Connections