
Cut Your Losses in Large-Vocabulary Language Models
Optimize memory footprint in language models with Cut Cross-Entropy method for efficient training and reduced memory consumption.

Protect your DeepSeek model deployments with Amazon Bedrock Guardrails
Implement robust safety protections for DeepSeek-R1 and other open weight models using Amazon Bedrock Guardrails

How Untold Studios empowers artists with an AI assistant built on Amazon Bedrock
Empowering artists with an AI assistant built on Amazon Bedrock at Untold Studios

Building the future of construction analytics: CONXAI’s AI inference on Amazon EKS
Empowering construction experts with AI analytics for GDPR-compliant insights using Amazon EKS

Accelerate your Amazon Q implementation: starter kits for SMBs
Learn how to accelerate Amazon Q implementation with starter kits tailored for SMBs

Governing the ML lifecycle at scale, Part 4: Scaling MLOps with security and governance controls
Managing ML lifecycle with security and governance controls for scaled MLOps

Governance Risk & Compliance: Essential Strategies
Implementing comprehensive governance, risk management and compliance frameworks for AI is crucial for safely realizing its benefits and navigating potential risks and regulatory requirements.

Modular: PagedAttention & Prefix Caching Now Available in MAX Serve
State-of-the-art LLM inference optimizations with PagedAttention & Prefix Caching now available in MAX Serve

Streamlining data collection for improved salmon population management
Utilizing cutting-edge computer vision methods to automate salmon monitoring for improved fisheries management.

Empowering women with cloud and AI skills: Register for the Google Launchpad for Women series
Empower women in cloud and AI with Google Launchpad series

Using capa Rules for Android Malware Detection
Enhancing Android malware detection using capa rules and Gemini summarization for analyzing native ARM ELF files

Announcing public beta of Gen AI Toolbox for Databases
Announcing public beta of Gen AI Toolbox for Databases - A server empowering application developers to connect generative AI applications to databases with secure access, scalability, and manageability