engblogs

Powering the agents: Workers AI now runs large models, starting with Kimi K2.5

Powerful agent development on Cloudflare with frontier open-source models like Kimi K2.5, detailing the end-to-end large-model inference stack, serverless deployment, and cost-efficient asynchronous workflows for enterprise agents.

3/19/2026

Modular AI

Modular: Modular 26.2: State-of-the-Art Image Generation and Upgraded AI Coding with Mojo

Modular 26.2 unifies text, audio, and now image generation/editing under MAX, delivers 4x FLUX.2 speedups, and advances Mojo for AI-assisted, high-performance GPU-kernel development with new coding skills and open-source tooling.

3/19/2026

Databricks

Introducing AI Runtime: Scalable, Serverless NVIDIA GPUs on Databricks for Training and Finetuning

AI Runtime enables on-demand, serverless GPU training and fine-tuning at scale on the Databricks Lakehouse, unifying interactive notebooks, distributed training across A10s and H100s, and end-to-end governance and observability with Lakeflow and Genie Code.

3/19/2026

OpenAI

How we monitor internal coding agents for misalignment

A concise guide to detecting and preventing misalignment in internal coding agents through monitoring, telemetry, governance, and automated safety controls.

3/19/2026

AWS ML

Run NVIDIA Nemotron 3 Super on Amazon Bedrock

A detailed look at NVIDIA Nemotron 3 Super on Amazon Bedrock, covering its Hybrid Transformer-Mamba Mixture of Experts architecture, Latent MoE, serverless fully managed inference, open weights and datasets, and real-world use cases across software development, finance, cybersecurity, search, and retail, plus getting started with AWS CLI/SDK.

3/19/2026

Building an MCP Ecosystem at Pinterest

A concise technical overview of building and scaling the MCP ecosystem at Pinterest, detailing architecture, integration patterns, and developer tooling for interoperable components.

3/19/2026

AWS ML

Introducing V-RAG: revolutionizing AI-powered video production with Retrieval Augmented Generation

V-RAG introduces a retrieval-augmented approach to AI-powered video production that grounds generated videos in retrieved reference imagery via a vector database, enhancing accuracy, customization, scalability, and multimodal capabilities while reducing hallucination.

3/19/2026

AWS ML

Use RAG for video generation using Amazon Bedrock and Amazon Nova Reel

A VRAG-powered, multimodal pipeline that combines image retrieval, prompt-based video generation, and batch processing with Amazon Bedrock, Amazon Nova Reel, OpenSearch vector engine, and S3 to transform structured text and reference images into scalable, AI-generated videos.

3/19/2026

Databricks

Announcing General Availability of Real-Time Mode for Apache Spark Structured Streaming on Databricks

GA of Real-Time Mode (RTM) in Spark Structured Streaming on Databricks delivers millisecond-level latency with a unified Spark engine, eliminating the need for a separate streaming engine like Flink for real-time workloads.

3/19/2026

OpenAI

OpenAI to acquire Astral

Technical overview of the strategic and architectural implications of OpenAI's planned acquisition of Astral, focusing on integration, data flow, and platform interoperability.

3/19/2026

Stripe

Testing the impact of Adaptive Pricing across 1.5M subscription checkout sessions

An empirical analysis of Adaptive Pricing for subscriptions, showing how local currency pricing across 1.5M checkout sessions improves conversion, authorization, and lifetime value while stabilizing renewals amid fluctuating exchange rates.

3/19/2026

Duolingo

4 must-know particles for ending Japanese sentences

A concise, technically oriented guide to four Japanese sentence-final particles (ka, ne, yo, yo ne) and how their subtle nuances, tones, and social functions shape questions, agreement, and everyday conversation.

3/19/2026