An agent learns to choose the best course of action

The agent, a person or thing that can act or produce an effect, learns to select the optimal action through reinforcement learning using reward signals. It makes predictions about its environment using sensory observations. Credit: Han et al., 2024
Date:
05 June 2024
Copyright OIST (Okinawa Institute of Science and Technology Graduate University, 沖縄科学技術大学院大学). Creative Commons Attribution 4.0 International License (CC BY 4.0).