OpenAIIntroducing GPT-5.4 mini and nano
Concise introduction to GPT-5.4 mini and nano, highlighting their compact architectures and practical implications for developers.
Apple MLAMES: Approximate Multi-modal Enterprise Search via Late Interaction Retrieval
AMES presents a backend-agnostic, two-stage late-interaction framework that unifies text, image, and video into a shared space for approximate multimodal enterprise search, employing ANN-based parallel token search with per-document Top-M MaxSim re-ranking in production Solr deployments.
From vendors to vanguard: Airbnb’s hard-won lessons in observability ownership
Airbnb's evolution from vendor-driven observability to internal ownership, sharing hard-won lessons on reliability and platform visibility.
DropboxHow we optimized Dash's relevance judge with DSPy
Case study of using DSPy to adapt and optimize Dash's relevance judge across models, yielding lower NMSE, improved reliability, and scalable, cost-efficient production.
Google CloudIntroducing multi-cluster GKE Inference Gateway: Scale AI workloads around the world
Previewing multi-cluster GKE Inference Gateway enables globally scalable, fault-tolerant AI inference by intelligently routing across multiple clusters and regions with model-aware load balancing and shared accelerator resources.
AWS MLAWS AI League: Atos fine-tunes approach to AI education
Atos partners with AWS to scale AI education through the AWS AI League, combining hands-on, gamified learning with SageMaker JumpStart to fine-tune LLMs for real-world insurance underwriting and deliver measurable AI fluency at scale.
Two Sigma5 Career Myths From Women in Engineering
Five women in engineering debunk career myths by sharing winding, cross-domain paths that emphasize curiosity, adaptability, and relationships over a fixed plan.
Dear Duolingo: How did the days of the week get their names?
A technical explainer tracing how the names of the days of the week arose, from Babylonian celestial naming to Latin and Germanic adaptations, and including numerical systems across languages.
DatabricksTalking to the Ground
Databricks' AI-powered, NLP-driven Genie Research Agent unifies subsurface, IoT, and ERP data in a lakehouse to enable real-time, cross-domain insights that cut NPT and boost EBITDA.
DatabricksHow TetraScience accelerates biopharma with production-ready data and scientific intelligence
How TetraScience's production-ready Scientific AI platform unifies heterogeneous lab data and automates end-to-end workflows across discovery, development, manufacturing, and quality to accelerate biopharma with explainable, audit-ready AI built on Databricks and NVIDIA infrastructure.
Lambda LabsLambda at NVIDIA GTC 2026: building the Superintelligence Cloud
Lambda's NVIDIA GTC 2026 debut outlines the Superintelligence Cloud architecture, pairing NVIDIA Vera CPUs with NVL72 Superclusters, Bare Metal Instances, NVIDIA Quantum-X Photonics, and STX-powered storage to accelerate agentic AI and scalable AI factories.
AWS MLAWS and NVIDIA deepen strategic collaboration to accelerate AI from pilot to production
AWS and NVIDIA expand a production-ready AI stack by scaling GPU-backed infrastructure, accelerating interconnects for disaggregated LLM inference, and enabling Reinforcement Fine-Tuning on Bedrock to move AI from pilot to production.