Dueling Network Architecture
Advanced & Deep RL DS practice problem on Onlearn.
Difficulty: hard.
Topics: Understanding Dueling Deep Q-Networks (Dueling DQN), Advantage Function, State-Value Function, Aggregation Layer, Mean Centering, Decoupled Representation, Reinforcement Learning, Deep Learning, Neural Network Architectures, Optimization Theory, Function Approximation, Value-Based Methods, Temporal Difference Learning, Policy Evaluation, Deep Q-Learning, Experience Replay.
Implement a Dueling DQN architecture class using PyTorch. The model must take an input dimension and number of actions, outputting Q values calculated as Q(s, a) = V(s) + (A(s, a) mean(A(s, a))).