engblogs

summaries of the latest blog articles from your favorite tech companies.
CloudflareCloudflare

Route leak incident on January 22, 2026

Misconfigured routing policy automation caused a 25-minute IPv6 BGP route leak at Cloudflare’s Miami data center, exposing prefixes to unintended peers and workloads, followed by rapid rollback, incident response, and security-focused mitigations.

1/23/2026
Google CloudGoogle Cloud

Improving workflow orchestration with Apache Airflow 3.1 in Cloud Composer

Concise overview of Cloud Composer's Airflow 3.1 preview, detailing a decoupled architecture, DAG versioning, managed backfills, event-driven scheduling, HITL workflows, and multi-language extensibility to advance workflow orchestration.

1/23/2026
Google CloudGoogle Cloud

Monitoring Google ADK agentic applications with Datadog LLM Observability

Shows how Datadog LLM Observability automatically instruments Google ADK agentic applications to trace agent decisions, monitor token usage and latency, evaluate response quality and safety, and run offline and online experiments to optimize performance before production.

1/23/2026
AWS MLAWS ML

How the Amazon.com Catalog Team built self-learning generative AI at scale with Amazon Bedrock

A scalable, self-learning generative AI system for catalog enrichment that uses multiple lightweight worker models and a supervisor agent on Amazon Bedrock to extract and improve product attributes at scale.

1/23/2026
AWS MLAWS ML

Build AI agents with Amazon Bedrock AgentCore using AWS CloudFormation

Demonstrates provisioning and orchestrating production-grade AI agents with Amazon Bedrock AgentCore via AWS CloudFormation and Infrastructure as Code, highlighting modular templates, observability, and end-to-end weather-focused workflows.

1/23/2026
OpenAIOpenAI

Inside GPT-5 for Work: How Businesses Use GPT-5

A technical look at how GPT-5 enables workplace AI, with deployment patterns, integration approaches, and real-world business use cases.

1/22/2026
DatabricksDatabricks

Building Responsible and Calibrated AI Agents with Databricks and MLflow: A Real-World Use Case Deep Dive

Explores how to architect, evaluate, and govern production-grade AI agents on Databricks MLflow (with LangGraph) to ensure responsible, calibrated performance in telecom use cases through guardrails, monitoring, and guideline-driven evaluation.

1/22/2026
Google CloudGoogle Cloud

Getting Started with Gemini 3: Deploy Your First Gemini 3 App to Google Cloud Run

An end-to-end guide to building, testing, and deploying a Gemini 3 app on Google Cloud Run using Google AI Studio's Build mode, turning ideas into a shareable URL with API access.

1/22/2026
OpenAIOpenAI

Scaling PostgreSQL to power 800 million ChatGPT users

A concise exploration of scaling PostgreSQL to support 800 million ChatGPT users, focusing on performance, reliability, and operational scalability.

1/22/2026
SalesforceSalesforce

How Agentforce, Data, and Apps Turned the Salesforce Stack into Agentforce 360

How Agentforce 360 unifies Salesforce’s data, agent, and applications into a single real-time, AI-driven platform powering enterprise experiences across service, sales, and marketing.

1/22/2026
Google CloudGoogle Cloud

How Google SREs Use Gemini CLI to Solve Real-World Outages

Gemini CLI and Gemini 3 empower Google SREs to orchestrate AI-assisted, human-in-the-loop outage response—from paging to postmortem—dramatically shortening Mean Time to Mitigation.

1/22/2026
Google CloudGoogle Cloud

Scaling WideEP Mixture-of-Experts inference with Google Cloud A4X (GB200) and NVIDIA Dynamo

Scaling WideEP MoE inference on Google Cloud's A4X (GB200 NVL72) with NVIDIA Dynamo to deliver rack-scale, disaggregated MoE serving that balances throughput and latency via GPUDirect RDMA, KV caching, and GKE orchestration.

1/22/2026