Is reinforce model-free?
Is reinforce model-free? In reinforcement learning (RL), a model-free algorithm (as opposed to a model-based one) is an algorithm which does not use the transition probability distribution (and the reward function) associated with the Markov decision process (MDP), which, in RL, represents the problem to be solved. Is AlphaZero model-free? Model-Free vs Model-Based RL Agents […]