
LLM in a Flash: Efficient Large Language Model Inference with Limited Memory
Efficiently deploying large language models on devices with limited memory using flash memory optimization techniques.

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Proposing a method for aligning large language models with human expectations through self-rewarding contrastive prompt distillation.

Investigation of a Cross-regional Network Performance Issue
Analysis of a cross-regional network performance issue uncovers the root cause in a Linux kernel upgrade affecting TCP receive window size calculations

Build an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and AWS CloudFormation
Automate the deployment of an end-to-end RAG solution using Knowledge Bases for Amazon Bedrock and AWS CloudFormation
Surprising findings from our analysis of 3DS transactions in the US
Insights on 3DS transactions analysis in the US reveal surprising findings that highlight differences in authentication behaviors between regions

Continuous Delivery on Google Cloud with Gitlab CI/CD and Cloud Deploy
Automate software delivery from code commit to production release on Google Cloud using Gitlab CI/CD and Cloud Deploy.

Catalog, query, and search audio programs with Amazon Transcribe and Knowledge Bases for Amazon Bedrock
Catalog, query, and search audio programs efficiently using Amazon Transcribe and Knowledge Bases for Amazon Bedrock

Faster LLMs with speculative decoding and AWS Inferentia2
Accelerate large language model inference with speculative decoding on AWS Inferentia2

DCPerf: An open source benchmark suite for hyperscale compute applications
An open source benchmark suite, DCPerf, for hyperscale compute applications aimed at improving hardware and software optimization and platform design.

A RoCE network for distributed AI training at scale
Building a robust RoCE network for large-scale distributed AI training workloads at Meta

A new generation of African talent brings cutting-edge AI to scientific challenges
A new generation of African talent leverages cutting-edge AI for scientific challenges through the AI for Science Master’s program at AIMS
Friend Streak: a new way to stay motivated together
Introducing Friend Streak, a social motivation feature for learning together with friends on Duolingo.