Modular AIModular: MAX GPU: State of the Art Throughput on a New GenAI platform
Analyzing performance metrics and comparisons of MAX GPU and vLLM on AI inference workloads - exploring throughput and productivity enhancements with a focus on benchmarking
Modular AIModular: Introducing MAX 24.6: A GPU Native Generative AI Platform
MAX 24.6: A GPU Native Generative AI Platform revolutionizing AI infrastructure with unique technologies and improved performance.
Modular AIModular: Build a Continuous Chat Interface with Llama 3 and MAX Serve
Create a chat application using Llama 3 and MAX Serve with efficient token management and concurrent request handling.
Google CloudReach beyond the IDE with tools for Gemini Code Assist
Leverage Gemini Code Assist tools to access information and tools outside the IDE for uninterrupted flow and enhanced development experience
Google CloudHow Virgin Media O2 uses Privileged Access Manager to achieve principle of least privilege
Virgin Media O2 leverages Google Cloud's Privileged Access Manager to implement least-privilege principle for secure access management
AWS MLHow TUI uses Amazon Bedrock to scale content creation and enhance hotel descriptions in under 10 seconds
Enhancing content creation and hotel descriptions at TUI using Amazon Bedrock for generative AI in under 10 seconds
AWS MLSimplify multimodal generative AI with Amazon Bedrock Data Automation
Automate multimodal generative AI tasks with Amazon Bedrock Data Automation for media analysis and intelligent document processing workflows
DatabricksSeven West Media: Pioneering the Future of Personalized Digital Engagement
Pioneering personalized digital engagement through AI-driven strategies and data democratization with Seven West Media and Databricks
Snorkel AIHow LLM evaluation drives better models in Snorkel Flow
Optimizing large language model (LLM) evaluation process in Snorkel Flow for improved model performance
SalesforceScaling AI Systems: Secrets for Managing 100,000 Training and Metadata Requests Per Minute
Managing high traffic volumes and database-heavy operations while scaling AI systems for seamless metadata consistency and robust data integrity.
Google DeepMindFACTS Grounding: A new benchmark for evaluating the factuality of large language models
Introducing FACTS Grounding: A new benchmark for evaluating factuality in large language models with an online leaderboard for tracking industry progress.
DatabricksDatabricks at NRF 2025: The future of retail runs on the Data Intelligence Platform
Discover the future of retail through intelligent data solutions at Databricks' booth at NRF 2025.