SalesforceShattering AWS’s 250K-IP Ceiling: How Data 360 Reached 1 Million IPs with Zero-Downtime Migration
Data 360 overcame AWS's 250K IP limit by implementing prefix delegation combined with custom AI-driven observability and a dedicated SFSG, enabling zero-downtime scaling to 1 million IPs within Hyperforce's architectural constraints.
MetaPython Typing Survey 2025: Code Quality and Flexibility As Top Reasons for Typing Adoption
The 2025 Python Typing Survey reveals widespread adoption of type hints across experience levels, highlights key reasons like code quality and flexibility for their use, identifies challenges in tooling and third-party support, and outlines community-driven requests for enhanced features and better ecosystem consistency.
Lambda LabsHow to serve Kimi-K2-Instruct on Lambda with vLLM
A step-by-step guide to deploying and benchmarking the trillion-parameter Kimi-K2-Instruct MoE language model on Lambda's multi-GPU setup using vLLM for efficient large-scale inference.
CloudflareHow Workers powers our internal maintenance scheduling pipeline
Cloudflare uses a sophisticated maintenance scheduler powered by Cloudflare Workers and graph processing to automate and optimize global data center maintenance without disrupting customer traffic.
Solving database contention with optimistic locking
Implementing optimistic locking at Duolingo dramatically reduced database lock contention, improving notification timeliness and resource efficiency through careful testing and monitoring.
AWS MLBuild a multimodal generative AI assistant for root cause diagnosis in predictive maintenance using Amazon Bedrock
Leverage Amazon Bedrock's multimodal generative AI assistant with advanced sensor data analysis, guided troubleshooting, and multimodal retrieval to enhance root cause diagnosis in predictive maintenance, reducing downtime and improving operational efficiency across industries.
AWS MLEnhance document analytics with Strands AI Agents for the GenAI IDP Accelerator
Leverage the Analytics Agent, powered by Strands AI Agents within the GenAI IDP Accelerator, to enable non-technical users to perform natural language document analytics and generate actionable visual insights at scale without SQL expertise.
AWS MLDeploy Mistral AI’s Voxtral on Amazon SageMaker AI
Guide to deploying Mistral AI’s multimodal Voxtral models on Amazon SageMaker using vLLM and BYOC for advanced audio-text processing and function calling capabilities.
AWS MLMove Beyond Chain-of-Thought with Chain-of-Draft on Amazon Bedrock
Explore how Chain-of-Draft prompting on Amazon Bedrock significantly reduces token usage and latency compared to Chain-of-Thought, optimizing large language model reasoning for cost-effective, high-performance AI applications.
MetaDrP: Meta’s Root Cause Analysis Platform at Scale
DrP is Meta’s scalable root cause analysis platform that automates incident investigations using a flexible SDK, ML algorithms, and seamless workflow integrations to significantly reduce MTTR and improve on-call productivity across large-scale systems.
Google CloudFrom Code to Cloud: Three Labs for Deploying Your AI Agent
Explore three hands-on labs demonstrating how to deploy AI agents on Google Cloud using Vertex AI Agent Engine, Cloud Run, and Google Kubernetes Engine for scalable, secure, and production-ready applications.
Google CloudWhy Stochastic Rounding is Essential for Modern Generative AI
Stochastic rounding, a probabilistic rounding technique supported by modern hardware like Google Cloud TPUs and NVIDIA Blackwell GPUs, enables stable and efficient low-precision training for large generative AI models by preserving small gradient updates.