
Inside a SoundCloud Microservice
Exploring the inner workings of a microservice at SoundCloud

Better exploration with parameter noise
Improving learning efficiency through the use of parameter noise

Proximal Policy Optimization
Understanding Proximal Policy Optimization for Enhanced Reinforcement Learning

Robust adversarial inputs
Exploring strategies to defend against robust adversarial inputs in machine learning models

Hindsight Experience Replay
Exploring the benefits of hindsight experience replay in machine learning algorithms

Teacher–student curriculum learning
Exploring teacher-student curriculum learning methods in the context of education

Faster physics in Python
Optimizing Python code performance for physics simulations

Remote device sign-in
A method for signing in to a device without a keyboard using a game controller and onscreen keyboard.

A Better Model of Data Ownership
Defining ownership of datasets and ensuring the right teams own the right datasets for better data management.

Learning from human preferences
Leveraging insights from human preferences to enhance user experiences.

Learning to cooperate, compete, and communicate
Exploring the dynamics of cooperation, competition, and communication

UCB exploration via Q-ensembles
Optimizing exploration using Q-ensembles in UCB algorithms