OpenAIImproving mathematical reasoning with process supervision
Training a model to improve mathematical reasoning by rewarding correct steps of reasoning instead of just the final answer, leading to better performance and aligning the model with human thinking.
Google AILarge sequence models for software development activities
Large sequence models for software development activities that use the process of software development as the source of training data, resulting in promising results for professional software developers and the potential for developing agents that can assist across the software development process.
The New York TimesDay in the Life of a Senior Software Engineer on the Photo Publishing Team
A glimpse into the daily responsibilities and projects of a senior software engineer on the Photo Publishing Team at the New York Times.
Snorkel AISnowflake Snowpark: cloud SQL and Python ML pipelines
Snowflake Snowpark enables organizations to take ML experiments into production by providing production-grade ML data pipelines, data preparation in Python, and leveraging Scikit-Learn for feature engineering and ML model training in Snowflake.
Jane StreetOxidizing OCaml: Locality
Comparing the Rust and OCaml programming languages' system for tracking lifetime and ownership
MIT AICelebrating the impact of IDSS
Reflections on the impact of the Institute for Data, Systems, and Society at MIT during a two-day conference, ahead of the departure of its founding Director Munther Dahleh.
Google AIBarkour: Benchmarking animal-level agility with quadruped robots
Introducing the Barkour agility benchmark for quadruped robots, which measures robot agility and mobility through a diverse and challenging obstacle course, encouraging the development of fast and versatile locomotion controllers.
Google AIFoundation models for reasoning on charts
This blogpost discusses the challenges of reasoning on charts in visual language and proposes a foundation model called MatCha, which is trained on chart de-rendering and math reasoning tasks, surpassing previous state-of-the-art models in ChartQA.
OpenAIDemocratic inputs to AI
How to incorporate democratic inputs into AI development
Snorkel AILarge language models: their history, capabilities and limitations
Exploring the history, capabilities, and limitations of large language models, from the emergence of ChatGPT to the evolution of models like BERT and GPT-3, discussing key terms like attention, embeddings, and transformers, along with ways to adapt and use these models for specific tasks.
Google DeepMindAn early warning system for novel AI risks
A research framework for evaluating general-purpose AI models against novel threats, including extreme risks, to ensure responsible development and deployment.
MIT AIProbabilistic AI that knows how well it’s working
Probabilistic AI that provides accurate estimations of data explanations.