Glossary
An AI learning method where a model learns by receiving rewards and penalties based on its actions.