
Scaling laws for reward model overoptimization
Understanding the scaling laws involved in overoptimization of reward models.

Augmenting Fuzzy Matching with Human Review to Maximize Precision and Recall
Using human review in combination with a classification model to improve precision and recall in fuzzy matching

What's New with SoundCloud, October 2022
Updates to the SoundCloud iOS and Android apps to improve user experience based on user feedback.

Stopping malaria in its tracks
Developing a better malaria vaccine with the help of AI that could save hundreds of thousands of lives every year

Collective Decision-Making with AHP
The blogpost discusses how the NYT Identity team used the Analytic Hierarchy Process (AHP) to make collective decisions on selecting a user ID format.

Android Image Loading at SoundCloud
Exploring the new image loading features in the recent SoundCloud Android app redesign.

Measuring perception in AI models
Introducing the Perception Test, a multimodal benchmark using real-world videos to evaluate perception capabilities of AI models.

From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative
Brands are leveraging Microsoft AI to increase productivity and creativity

Measuring perception in AI models
Introducing the Perception Test: a multimodal benchmark to evaluate the perceptual capabilities of AI models using real-world videos.

How undesired goals can arise with correct rewards
Exploring the emergence of undesired goals despite correct rewards in AI systems

How undesired goals can arise with correct rewards
Understanding how AI systems can unintentionally pursue undesired goals due to goal misgeneralisation, even when trained with correct specifications.

Microsoft open sources its ‘farm of the future’ toolkit
Microsoft open sources its ‘farm of the future’ toolkit.