engblogs

summaries of the latest blog articles from your favorite tech companies.
Google CloudGoogle Cloud

Hex-LLM: High-efficiency large language model serving on TPUs in Vertex AI Model Garden

Hex-LLM: High-efficiency large language model serving on TPUs in Vertex AI Model Garden, delivering competitive performance with high throughput and low latency

7/26/2024
DatabricksDatabricks

Announcing General Availability of Lakehouse Federation

Introducing Lakehouse Federation for unified data discovery, query, and governance across multiple cloud platforms

7/26/2024
DatabricksDatabricks

A Framework for Multi-Model Forecasting on Databricks

A comprehensive framework for evaluating and deploying multiple forecasting models at scale on Databricks.

7/26/2024
Google CloudGoogle Cloud

Leverage enterprise data with Denodo and Vertex AI for generative AI applications

Unlock generative AI potential by harmonizing enterprise data with Denodo and Vertex AI technologies

7/25/2024
Snorkel AISnorkel AI

Meta’s Llama 3.1 405B is the new Mr. Miyagi, now what?

Explores the potential of Meta’s new Llama 3.1 405B model, Larry, as a teacher, judge, and for distillation in AI applications.

7/25/2024
SalesforceSalesforce

How Salesforce’s New Speech-to-Text Service Uses OpenAI Whisper Models for Real-Time Transcriptions

Exploring how Salesforce's new Speech-to-Text service utilizes OpenAI Whisper models for real-time transcriptions

7/25/2024
AWS MLAWS ML

Amazon SageMaker inference launches faster auto scaling for generative AI models

Enhance generative AI models with faster auto scaling using Amazon SageMaker inference

7/25/2024
Google CloudGoogle Cloud

Powering a sustainable future with ChromeOS

Empowering businesses to embrace sustainability with ChromeOS and ChromeOS Flex

7/25/2024
Google CloudGoogle Cloud

Shiseido: building a data analysis platform using BigQuery for 80% cost savings

Optimizing data analysis platform with BigQuery for substantial cost savings

7/25/2024
Google CloudGoogle Cloud

Leverage enterprise data with Denodo and Vertex AI for generative AI applications

Unlock generative AI potential by leveraging enterprise data with Denodo and Vertex AI

7/25/2024
Google CloudGoogle Cloud

Understanding Airflow DAG and task concurrency on Google Cloud Composer

Optimizing concurrency levels in Apache Airflow DAG and task execution on Google Cloud Composer

7/25/2024
AWS MLAWS ML

Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

Detect and recover from AWS Neuron node issues in Amazon EKS clusters for fault-tolerant ML training

7/25/2024