19(Sutton, 1991; Sutton and Barto, 2018). See also Mattar and Daw (2018) for a theoretical unification of replay and planning.