9Some authors use the term “model-free reinforcement learning” to clearly distinguish this from anything using planning. Planning uses a model of the world, thus it would be called “model-based”. Model-based reinforcement learning then refers to a set of algorithms for solving the planning problem, with the possible modification that instead of reaching a single goal, the plan may still attempt to maximize the sum of rewards.