Robotics & Perception/Reinforcement Learning

Reinforcement learning 기억 되새기기

https://chatgpt.com/share/67b31115-6508-800d-8ada-662b2fed132e

  1. Basics
    1. Monte-Carlo, Temporal Difference
  2. Extensions
    1. Policy Optimization
      • Soft Actor Critic
    2. Q-learning
      1. Double Q-learning
      2. Overestimation issue 피하기 --IQL
    3. Interpolating Between Policy Optimization and Q-Learning.
      1. Soft Actor-Critic
      2. DDPG