
Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
The blogpost explores the importance of retaining emergent behaviors from curious exploration in maximizing the potential and usefulness of curiosity-based learning techniques.

Challenges in Detoxifying Language Models
Challenges in Detoxifying Language Models: This blogpost discusses the challenges in mitigating toxic language generation in language models and highlights the unintended consequences of detoxification.

Challenges in Detoxifying Language Models
Exploring methods to mitigate toxicity in language models and evaluating their effectiveness and limitations using classifier-based automatic toxicity evaluation.

TruthfulQA: Measuring how models mimic human falsehoods
Exploring model accuracy in replicating human misconceptions through TruthfulQA

Helen Toner joins OpenAI’s board of directors
Announcement of Helen Toner's inclusion in OpenAI's board of directors.

Backchannel: A relationship-based digital identity system
An alternative approach to digital identity that replaces user profiles with trusted digital relationships.

Goodbye Core_kernel
Restructuring standard libraries to eliminate difference between Core_kernel and Core at Jane Street with open source release anticipated by end of year

Service Architecture at SoundCloud — Part 2: Value-Added Services
Examining the evolution of service architecture at SoundCloud with a focus on value-added services.

Downstream Evaluations of Rotary Position Embeddings
Comparing Rotary Position Embeddings with GPT-style learned position embeddings in downstream evaluations.

OpenAI Codex
Exploring the capabilities of OpenAI Codex in automation and natural language processing

What the interns have wrought, 2021 edition
Internship season at Jane Street in 2021, including in-person interactions and insights into the company's operations.

Building architectures that can handle the world’s data
Building a general-purpose architecture, Perceiver IO, that can handle diverse inputs and produce a wide variety of outputs, enabling the processing of various types of data.