
AI safety via debate
Exploring AI safety strategies using debate techniques

Repeatable exploratory programming
A blogpost about using Expect tests for repeatable exploratory programming

Evolved Policy Gradients
Exploring advanced techniques for policy gradients in machine learning models

Gotta Learn Fast: A new benchmark for generalization in RL
Exploring advancements in generalization for Reinforcement Learning through a new benchmark approach.

Retro Contest
Explore the resurgence of vintage development challenges in the modern era.

OCaml all the way down
Exploring the evolution of the software stack at Jane Street from statistical research to strategy execution.

Putting the I back in IDE: Towards a Github Explorer
A system for editing and reviewing code that puts the I back in IDE.

Managing Unplanned and Support Tasks
A blogpost about a process for managing unplanned and support tasks to minimize interruptions and allocate more time to planned features.

Variance reduction for policy gradient with action-dependent factorized baselines
Optimizing policy gradient efficiency with action-dependent factorized baselines

Improving GANs using optimal transport
Enhancing GANs performance through optimal transport techniques

Report from the OpenAI hackathon
Insights and Highlights from the OpenAI hackathon

On first-order meta-learning algorithms
Exploring the advancements in first-order meta-learning algorithms