Build More Capable LLMs with Retrieval Augmented Generation

Increase Llama 2’s Latency and Throughput Performance by Up to 4X

Enhancing RAG Pipelines in Haystack: Introducing DiversityRanker and LostInTheMiddleRanker

Towards Green AI: How to Make Deep Learning Models More Efficient in Production

Unraveling the Power of Data Fabric: Transforming Businesses with Unified Data Insights

Regulating Generative AI

Decoding Auto-GPT

Deploy thousands of model ensembles with Amazon SageMaker multi-model endpoints on GPU to minimize your hosting costs

This Week in AI, August 7: Generative AI Comes to Jupyter & Stack Overflow • ChatGPT Updates

Revolutionizing Supply Chain Management: The Power of AI in Strategic Sourcing and Inventory Optimization