Apple MLAccelerating LLM Inference on NVIDIA GPUs with ReDrafter
Optimizing LLM inference acceleration on NVIDIA GPUs with ReDrafter
DatabricksČeská spořitelna: How GenAI is Transforming Call Centers in the Financial Services Industry
Explore how Česká spořitelna implemented GenAI in call centers to enhance quality control and reduce operational costs while managing approximately 2 million calls annually.
MetaHow we think about Threads’ iOS performance
Improving iOS performance in the Threads app by monitoring key metrics, including %FIRE, TTNC, and cPSR, while tackling performance challenges and case studies on publish reliability and navigation latency.
AWS MLUsing natural language in Amazon Q Business: From searching and creating ServiceNow incidents and knowledge articles to generating insights
Integrating Amazon Q Business with ServiceNow for enhanced user productivity and insights through generative AI-powered assistants and custom plugins
AWS MLHow Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model
Fastweb fine-tuned Mistral model using SageMaker HyperPod as first step to build Italian large language model
Duologues: the conversations shaping Duolingo Design
Exploring conversations shaping Duolingo Design through Duologues speaker series
Google DeepMindFACTS Grounding: A new benchmark for evaluating the factuality of large language models
FACTS Grounding introduces a new benchmark and leaderboard to rigorously evaluate and improve the factual accuracy and grounding of large language model responses using comprehensive, multi-judge assessment across diverse, real-world documents.
Snorkel AIHow LLM evaluation drives better models in Snorkel Flow
Enhancing large language model (LLM) evaluation using Snorkel Flow for better models and faster results
DatabricksIntroducing Git Support for Queries in Databricks
Manage queries in version control with Git support in Databricks' New SQL Editor.
OpenAIOpenAI o1 and new tools for developers
Exploring the latest features of OpenAI o1 and innovative developer tools
DatabricksBenchmarking Domain Intelligence
Evaluating models performance on Domain Intelligence Benchmark Suite (DIBS) in comparison to academic benchmarks to highlight the need for domain-specific testing and choosing models based on specific needs.
NetflixTitle Launch Observability at Netflix Scale
Empowering successful title launches and discoverability at Netflix through robust observability solutions.