Equivalence between Policy Gradients and Soft Q-Learning
Inspecting the gradients of entropy-augmented policy updates to show their equivalence
Braden Hoagland • in Miscellany •
Distributional Deep Q-Learning
Expanding DQN to produce estimates of return distributions, and an exploration into why this helps learning
Braden Hoagland • in Miscellany •
Inaugural Post
The purpose statement and introduction to Computable AI.
Daniel Cox • in Miscellany •