engblogs

summaries of the latest blog articles from your favorite tech companies.
OpenAIOpenAI

Strengthening our safety ecosystem with external testing

Balancing transparency and safety in frontier AI through independent external third-party assessments that evaluate capability claims, safeguards, and governance across risk domains.

11/19/2025
OpenAIOpenAI

How evals drive the next chapter in AI for businesses

Evaluation frameworks ('evals') enable businesses to translate AI objectives into measurable, reliable outcomes through iterative testing, error analysis, and continuous improvement tailored to specific workflows and goals.

11/19/2025
OpenAIOpenAI

Strengthening our safety ecosystem with external testing

OpenAI enhances frontier AI safety by partnering with third party experts for independent evaluations, methodological reviews, and subject-matter expert probing, ensuring transparency, rigor, and trust in deployment decisions.

11/19/2025
Snorkel AISnorkel AI

A chat with the Terminal-Bench team

Terminal-Bench 2.0 and the Harbor framework offer robust, interactive, and scalable containerized benchmarks and execution environments designed to evaluate and optimize general-purpose CLI-based AI agents, driven by a community-focused, impact-oriented research philosophy.

11/19/2025
MIT AIMIT AI

New AI agent learns to use CAD to create 3D objects from sketches

MIT develops an AI agent trained on the VideoCAD dataset to operate CAD software via user-interface actions, transforming 2D sketches into 3D models and aiming to simplify CAD design for engineers and beginners alike.

11/19/2025
MIT AIMIT AI

The cost of thinking

Researchers find that advanced large language models (reasoning models) exhibit a 'cost of thinking' similar to humans by taking more time (tokens) on complex problems, revealing human-like stepwise problem-solving and internal computation parallels.

11/19/2025
OpenAIOpenAI

Intuit and OpenAI join forces on new AI-powered experiences

Intuit partners with OpenAI in a $100M+ multi-year deal to integrate AI-powered financial services and personalized experiences directly within ChatGPT, enhancing productivity and decision-making for consumers and businesses.

11/18/2025
MIT AIMIT AI

MIT Energy Initiative conference spotlights research priorities amidst a changing energy landscape

The MIT Energy Initiative conference highlights critical research priorities and collaborative strategies to address emerging energy challenges, focusing on grid resiliency, energy storage, sustainable fuels, carbon capture, vehicle electrification, and geopolitical factors influencing the energy transition.

11/18/2025
MetaMeta

Announcing the Completion of the Core 2Africa System: Building the Future of Connectivity Together

The completion of the 2Africa subsea cable system establishes unprecedented connectivity across Africa, Europe, and Asia, driving economic growth and digital transformation through advanced infrastructure and open access collaboration.

11/18/2025
MetaMeta

Efficient Optimization With Ax, an Open Platform for Adaptive Experimentation

Ax 1.0 is an open-source adaptive experimentation platform leveraging Bayesian optimization to efficiently guide complex, resource-intensive experiments across AI, infrastructure, and engineering domains.

11/18/2025
Lambda LabsLambda Labs

Lambda Raises Over $1.5B from TWG Global, USIT to Build Superintelligence Cloud Infrastructure

Lambda secures $1.5B funding from TWG Global and USIT to accelerate development of gigawatt-scale AI factories and superintelligence cloud infrastructure.

11/18/2025
CloudflareCloudflare

Cloudflare outage on November 18, 2025

A permissions change in Cloudflare's ClickHouse database caused a bot management feature file to exceed size limits, triggering widespread 5xx errors and service outages across Cloudflare's network on November 18, 2025, which were resolved through rollback and mitigation efforts by 17:06 UTC.

11/18/2025