Papers Discussions

G. Tesauro. Temporal Difference Learning and TD-gammon.

https://dl.acm.org/doi/10.1145/203330.203343
https://www.bkgm.com/articles/tesauro/tdl.html

April 11, 2025
  • Curiosity dream mechanism
  • Imitation learning

Silver et al. Reward is enough.

https://www.sciencedirect.com/science/article/pii/S0004370221000862

April 11, 2025
  • Multi objective reinforcement learning

Silver et al. Mastering the game of Go with deep neural networks and tree search.

https://www.nature.com/articles/nature16961

May 9, 2025

Silver et al. Mastering the Game of Go without Human Knowledge.

https://www.nature.com/articles/nature24270.epdf?author_access_token=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ

May 9, 2025

Leibo et al. Multi-agent Reinforcement Learning in Sequential Social Dilemma.

https://arxiv.org/abs/1702.03037

May 9, 2025

Sung Park et al. Generative Agents: Interactive Simulacra of Human Behavior.

https://arxiv.org/pdf/2304.03442.pdf

May 30, 2025

Dan Hendricks et al. Aligning AI Systems with Shared Human Values.

https://arxiv.org/abs/2008.02275

May 30, 2025