engblogs

Optimizing AI models through human feedback loop integration

Exploring the inner workings of a microservice at SoundCloud

Improving learning efficiency through the use of parameter noise

Understanding Proximal Policy Optimization for Enhanced Reinforcement Learning

Exploring strategies to defend against robust adversarial inputs in machine learning models

Exploring the benefits of hindsight experience replay in machine learning algorithms

Exploring teacher-student curriculum learning methods in the context of education

Optimizing Python code performance for physics simulations

A method for signing in to a device without a keyboard using a game controller and onscreen keyboard.

Defining ownership of datasets and ensuring the right teams own the right datasets for better data management.

Leveraging insights from human preferences to enhance user experiences.

Exploring the dynamics of cooperation, competition, and communication