Comments on Eight Abstracts

An unfocused sweep of eight abstracts from a very busy week in AI research: Emergent tool use, why hierarchical learning can work so well, brain-inspired hardware for artificial neural networks, pretraining and transfer learning for RL, chromatic network compression, semi-supervised reward shaping, WGAN model imitation for model-based RL, and navigation in turbulent flows!

Reward tampering

Improving safety and control by preventing all manner of reward tampering by the agent itself.

  • 1
  • 2