Edit me

Additional elements of reinforcement learning

Reward

A reward defines the goal of an RL problem, and specifies the immediate benefit (positive and negative events) of being in a specific state. It is the feedback sent by the environment to the RL agent, which the latter tries to maximise over time. In a biological system, we might think of rewards as analogous to the experiences of pleasure or pain.

Value function

The value function specifies what is rewarding over time by accumulating all the rewards obtained by the agent.

Value

The value is the total expected reward from a state and onwards into the future. It assists the agent in predicting future rewards by allowing the exploration of states that may otherwise withhold higher rewards. Based on the value assessment, actions are then taken by the RL agent.

RL model

RL agents systems can make decisions using either a model-based or model-free approaches, even though the boundary between the two may not be as clear-cut in practice. The model is a set of rules that the agent is provided with in order to have some understanding of its environment. In this way, the learner can explore its environment more efficiently. Alternatively, in model-free approaches no such information is provided, and the learner evaluates the environment solely by trial-and-error. .





Tags: