engblogs

Modular: Modverse #53: Community Builds, Research Milestones, and a Growing Ecosystem

Modular, community-driven update on research milestones and a growing Modverse ecosystem, highlighting easy MAX installs for local GenAI deployment and access to 500+ optimized models.

3/6/2026

Unified Context-Intent Embeddings for Scalable Text-to-SQL

A unified embedding framework that combines context and user intent to enable scalable Text-to-SQL conversion.

3/6/2026

Apple ML

Multi-Frequency Fusion for Robust Video Face Forgery Detection

A lightweight two-branch fusion of handcrafted cues—Wavelet-Denoised Feature with phase-only SPSL or Local Binary Patterns—built on an Xception baseline, boosts robustness in video face forgery detection with higher AUC and a smaller model footprint.

3/6/2026

Apple ML

Flow Matching with Semidiscrete Couplings

A semidiscrete optimal transport (SD-OT) approach to flow matching that replaces costly batch OT with a learned dual potential via SGD, enabling efficient SD-FM for training time-dependent velocity fields.

3/6/2026

Databricks

LogSentinel: How Databricks uses Databricks for LLM-Powered PII Detection and Governance

LogSentinel is a Databricks-powered, LLM-driven data classification and governance workflow that tracks schema evolution and labeling drift to automatically label PII, trigger remediation tickets, and feed improvements back into the Databricks Data Classification product via MLflow and a Mixture-of-Experts approach.

3/6/2026

OpenAI

How Descript enables multilingual video dubbing at scale

Explores how Descript enables scalable multilingual video dubbing, detailing the architecture, tooling, and end-to-end workflows required to deliver synchronized multilingual voiceovers.

3/6/2026

OpenAI

Codex Security: now in research preview

A concise analysis of Codex Security as it enters a research preview, outlining its security posture, potential use cases, and considerations for developers and researchers.

3/6/2026

Databricks

Building a near real-time application with Zerobus Ingest and Lakebase

Build a near real-time operational app by directly ingesting events with Zerobus Ingest into Delta tables and surfacing live analytics via Lakebase and Databricks Apps on the Databricks Data Intelligence Platform.

3/6/2026

Google Cloud

Calling all devs: Build the future of Multimodal AI in the Gemini Live Agent Challenge

Guide to building and deploying multimodal AI agents for the Gemini Live Agent Challenge, using Gemini models, the ADK, and Google Cloud services to deliver real-time, immersive cross-modal experiences across Live Agent, Creative Storyteller, and UI Navigator.

3/6/2026

Google Cloud

Proactive Preparation and Hardening Against Destructive Attacks: 2026 Edition

Proactive, defense-in-depth playbook for hardening IT/OT, cloud, and virtualization against destructive cyberattacks in 2026, covering external-facing risk, segmentation, strong authentication, privileged access workstations, and immutable backups.

3/6/2026

Apple ML

GenCtrl -- A Formal Controllability Toolkit for Generative Models

A formal framework for the controllability of generative models, treating human–model interaction as a control process and providing PAC-style guarantees on controllable-set estimation, with Plan-then-Generate approaches for data-to-text.

3/6/2026

OpenAI

How Balyasny Asset Management built an AI research engine for investing

Technical exploration of how Balyasny Asset Management built an AI-driven research engine to accelerate data-driven investment insights and quantitative decision-making.

3/6/2026