Computable AI

Equivalence between Policy Gradients and Soft Q-Learning

Inspecting the gradients of entropy-augmented policy updates to show their equivalence

Braden Hoagland • Mon 12 August 2019 in Miscellany •

Three Method Comparison for Traffic Signal Control

Comparing supervised learning, random search, and deep reinforcement learning on traffic signal control.

Daniel Cox • Sun 11 August 2019 in arXiv highlights •

Learning Compound and Composable Policies

Straightforward hierarchical RL for concurrent discovery of sub-policies and their controller.

Daniel Cox • Sun 04 August 2019 in arXiv highlights •

Efficient exploration with self-imitation learning

I wonder if that happens every time...

Daniel Cox • Sun 28 July 2019 in arXiv highlights •

Distributional Deep Q-Learning

Expanding DQN to produce estimates of return distributions, and an exploration into why this helps learning

Braden Hoagland • Fri 26 July 2019 in Miscellany •

Keeping to the Narrow Path

Better imitation learning with self-correcting policies by negative sampling.

Daniel Cox • Sun 21 July 2019 in arXiv highlights •

Look at This: Where We See Shapes, AI Sees Textures

CNNs trained in "the usual way" tend to learn something different than you might expect. They learn to recognize textures (local structure) rather than shapes (global structure).

Daniel Cox • Tue 16 July 2019 in Look at This •

Way Off-Policy Batch DRL

Pre-training using a generative model of pre-recorded trajectories and bias correction.

Daniel Cox • Sun 14 July 2019 in arXiv highlights •

A New Series arXiv Sampler

Beginning a new series highlighting a few interesting RL papers on the arXiv each week. This week: Simple curriculum learning, learning to interact with humans, and warm starting RL with propositional logic.

Daniel Cox • Sun 07 July 2019 in arXiv highlights •

Boltzmann Machines: Differentiation Work

My differentiation work while reading Ilya Sutskever on the biological plausibility of Boltzmann machines.

Daniel Cox • Sun 10 March 2019 in Math •

Categories