We are working on the course content with rocket speed !! sooner we will be loaded with all the courses

Course 1 : Reinforcement Learning-Tabular Solution Methods

Course Curriculum

Multi armed bandits
Multi-armed bandits are a class of reinforcement learning problems where an agent must decide which action (or "arm") to choose from a set of options, each associated with an unknown reward.

  • A k-armed Bandit Problem
    00:00
  • Action-value Methods
    00:00
  • The 10-armed Testbed
    00:00
  • Incremental Implementation
    00:00
  • Tracking a Nonstationary Problem
    00:00
  • Optimistic Initial Values
    00:00
  • Upper-Confidence-Bound Action Selection
    00:00
  • Thompson sampling
    00:00
  • Softmax Algorithm
    00:00
  • Gradient Bandit Algorithms
    00:00
  • Associative Search (Contextual Bandits)
    00:00

Familiarity with Markov property

Markov Decision Process (MDP)

Dynamic Programming

Monte Carlo Methods

TD learning

Student Ratings & Reviews

No Review Yet
No Review Yet
40,000

Material Includes

  • self-learning videos
  • PDFs
  • Python notebooks
  • Assignments
  • Quiz