Amazon Bedrock Adds RAG and LLM-as-a-Judge Tools to Boost AI Evaluations

Amazon Bedrock introduces RAG evaluation for Knowledge Bases and LLM-as-a-judge, enabling fast, automated assessment and optimization of generative AI applications. These tools streamline testing and improve application quality.

December 2, 20241099 views0

Amazon Bedrock Adds RAG and LLM-as-a-Judge Tools to Boost AI Evaluations

Dec 01, 2024, Global – Amazon Bedrock unveils two new preview capabilities to enhance evaluation and optimization capabilities for generative AI applications: RAG evaluation for Knowledge Bases and LLM-as-a-judge for model assessment. The promise of these capabilities is that they will simplify testing and shorten the time it takes to develop AI-ready solutions.

The RAG evaluation is an automated evaluation of Retrieval Augmented Generation (RAG) applications with evaluation metrics computed using large language models. Developers have the option to compare configurations and fine-tune applications for specific use cases. On the other hand, LLM-as-a-judge facilitates a humanlike evaluation of models, resulting in far less cost and time compared to traditional human assessments.

They measure quality dimensions such as correctness, helpfulness and adherence to the responsible AI principles with natural language explanations and normalized scores for interpretability. These capabilities are now available in preview in multiple AWS regions, with no extra charge on evaluation jobs beyond standard Amazon Bedrock pricing.

Amazon Bedrock’s evaluation tools have been designed to fast-track the deployment of generative applications by providing simple insights and reducing feedback loops. Developers can easily access the evaluation tools from within the Amazon Bedrock console.

Source: https://aws.amazon.com/blogs/aws/new-rag-evaluation-and-llm-as-a-judge-capabilities-in-amazon-bedrock/

Latest Stories:

AI Propels Edge Data Center Market to $22.11 Billion Growth by 2028

Open-Source AI Summit Abu Dhabi Sparks Global Innovation Dialogue

Japan Airlines Leverages AI to Improve Boarding Efficiency

Savio Jacob

Savio is a key contributor to Times OF AI, shaping content marketing strategies and delivering cutting-edge business technology insights. With a focus on AI, cybersecurity, machine learning, and emerging technologies, he provides business leaders with the latest news and expert opinions. Leveraging his extensive expertise in researching emerging tech, Savio is committed to offering unbiased and insightful content. His work helps businesses understand their IT needs and how technology can support them in achieving their goals. Savio's dedication ensures timely and relevant updates for the tech community.

Share

Leave a reply Cancel reply

Latest Posts

Netflix Uses AI to Enhance Search History: Greg Peters

Volkswagen Unveils AI-Based Automated Driving System

Notion AI vs ELSA AI: How AI Boosts Productivity and Learning Among Students

JZMOR Introduces AI-Based Risk Control System

BDx Opens Indonesia’s First AI Data Center