MetaFFmpeg at Meta: Media Processing at Scale
Meta details their migration from an internal FFmpeg fork to upstream FFmpeg, enabling parallel, multi-lane transcoding and real-time quality metrics to scale media processing and DASH playback across billions of uploads.
AWS MLBuilding specialized AI without sacrificing intelligence: Nova Forge data mixing in action
Nova Forge enables enterprise AI to specialize on proprietary data through data mixing, achieving strong in-domain performance while preserving general capabilities, as demonstrated by VOC classification and MMLU benchmarks.
SalesforceDelivering Accurate, Low-Latency Voice-to-Form AI in Real-World Field Conditions
Hybrid on-device speech-to-text paired with cloud semantic mapping enables accurate, low-latency voice-to-form data capture for Field Service Mobile in real-world conditions while preserving privacy and controlling costs.
AWS MLBuild safe generative AI applications like a Pro: Best Practices with Amazon Bedrock Guardrails
A practical guide to configuring Amazon Bedrock Guardrails for safe, production-ready generative AI, covering content filtering, prompt attack prevention, contextual grounding, and strategies for multi-turn conversations and detection-mode testing.
AWS MLBuild a serverless conversational AI agent using Claude with LangGraph and managed MLflow on Amazon SageMaker AI
Design and deploy a serverless, memory-enabled conversational AI agent that orchestrates tool calls and maintains context using Claude via Bedrock, LangGraph, and managed MLflow on Amazon SageMaker AI to handle multi-step customer-service workflows with observability.
DatabricksReal-Time Mode: Ultra-low latency streaming on Spark APIs without a second engine
Real-Time Mode (RTM) in Spark Structured Streaming delivers sub-second, ultra-low-latency real-time analytics within a single Spark API, eliminating the need for a second engine and simplifying architecture for live feature computation and inference.
Can AI agents build real Stripe integrations? We built a benchmark to find out
A benchmark-driven exploration of whether AI agents can autonomously build real Stripe integrations, covering backend, frontend, and browser-based end-to-end workflows, and highlighting strengths, failure modes, and the need for rigorous verification for production-grade payments.
Google CloudDesigning private network connectivity for RAG-capable gen AI apps
A reference architecture for private, internet-free connectivity of Retrieval-Augmented Generation (RAG)–enabled generative AI apps on Google Cloud, detailing end-to-end network design, data flow, and security controls.
Google CloudUnified Maintenance: A new, unified way to manage maintenance across Google Cloud
General Availability of Unified Maintenance introduces a centralized dashboard to view and manage cross-service maintenance events across Google Cloud, with standardized alerts via Cloud Logging and clear user controls.
DatabricksActivate first-party data with Meta Conversions API on Databricks
Leverages Meta Conversions API on Databricks to activate first-party signals from a governed Lakehouse for real-time, AI-enabled marketing optimization and scalable campaign activation.
DatabricksJefferies modernizes equity research at scale with Databricks and agentic analytics
Jefferies scales equity research with Databricks-powered agentic analytics and Jefferies Data Intelligence (JDI) to deliver fast, governance-backed, multi-source insights through a conversational analyst workflow.
MetaInvesting in Infrastructure: Meta’s Renewed Commitment to jemalloc
Meta renews its commitment to jemalloc to modernize the codebase, reduce maintenance and technical debt, improve memory efficiency, and deepen collaboration with the open-source community to adapt jemalloc to new hardware and workloads.