Amazon Web Services

27 posts

amazon-web-services

Amazon Bedrock AWS Audit Manager Customer Solutions Generative AI Responsible AI Technical How-to

Align and monitor your Amazon Bedrock powered insurance assistance chatbot to responsible AI principles with AWS Audit Manager

Generative AI applications should be developed with adequate controls for steering the behavior of FMs. Responsible AI considerations such as privacy, security, safety, controllability, fairness, explainability, transparency and governance help ensure that AI systems are trustworthy. In this post, we demonstrate how to use the AWS generative AI best practices framework on AWS Audit Manager to evaluate this insurance claim agent from a responsible AI lens.

Bharathi Srinivasan1/7/2025

amazon-web-services

Amazon Bedrock Amazon Q Amazon Q Business Customer Solutions Financial Services Industries Technical How-to

London Stock Exchange Group uses Amazon Q Business to enhance post-trade client services

In this blog post, we explore a client services agent assistant application developed by the London Stock Exchange Group (LSEG) using Amazon Q Business. We will discuss how Amazon Q Business saved time in generating answers, including summarizing documents, retrieving answers to complex Member enquiries, and combining information from different data sources (while providing in-text citations to the data sources used for each answer).

Ben Doughton1/7/2025

amazon-web-services

Amazon Bedrock Amazon OpenSearch Service Customer Solutions

Evaluate large language models for your machine translation tasks on AWS

This blog post with accompanying code presents a solution to experiment with real-time machine translation using foundation models (FMs) available in Amazon Bedrock. It can help collect more data on the value of LLMs for your content translation use cases.

Narcisse Zekpa1/7/2025

amazon-web-services

Amazon Bedrock Customer Solutions Generative AI

Parameta accelerates client email resolution with Amazon Bedrock Flows

In this post, we show you how Parameta used Amazon Bedrock Flows to transform their manual client email processing into an automated, intelligent workflow that reduced resolution times from weeks to days while maintaining high accuracy and operational control.

Siokhan Kouassi1/7/2025

amazon-web-services

Amazon Machine Learning Amazon SageMaker Amazon SageMaker AI Innovation and Reinvention Intermediate (200)Monitoring and observability Python Security Security & Governance Technical How-to

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

In this post, we walk you through the process to build an automated mechanism using Amazon SageMaker to process your log data, run training iterations over it to obtain the best-performing anomaly detection model, and register it with the Amazon SageMaker Model Registry for your customers to use it.

Nitesh Sehwani1/6/2025

amazon-web-services

Amazon Bedrock Architecture Best Practices Generative AI Thought Leadership

Optimizing costs of generative AI applications on AWS

Optimizing costs of generative AI applications on AWS is critical for realizing the full potential of this transformative technology. The post outlines key cost optimization pillars, including model selection and customization, token usage, inference pricing plans, and vector database considerations.

Vinnie Saini12/26/2024

amazon-web-services

Amazon SageMaker HyperPod Artificial Intelligence AWS Trainium Generative AI Technical How-to

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

In this blog post, we showcase how you can perform efficient supervised fine tuning for a Meta Llama 3 model using PEFT on AWS Trainium with SageMaker HyperPod. We use HuggingFace’s Optimum-Neuron software development kit (SDK) to apply LoRA to fine-tuning jobs, and use SageMaker HyperPod as the primary compute cluster to perform distributed training on Trainium. Using LoRA supervised fine-tuning for Meta Llama 3 models, you can further reduce your cost to fine tune models by up to 50% and reduce the training time by 70%.

Georgios Ioannides12/24/2024

amazon-web-services

Amazon-Lex Artificial-Intelligence Intermediate-(200)

Using transcription confidence scores to improve slot filling in Amazon Lex

When building voice-enabled chatbots with Amazon Lex, one of the biggest challenges is accurately capturing user speech input for slot values. Transcription confidence scores can help ensure reliable slot filling. This blog post outlines strategies like progressive confirmation, adaptive re-prompting, and branching logic to create more robust slot filling experiences.

Alex Buckhurst12/23/2024

amazon-web-services

Amazon-Bedrock Amazon-Neptune Generative-AI Partner-solutions

Improving Retrieval Augmented Generation accuracy with GraphRAG

Lettria, an AWS Partner, demonstrated that integrating graph-based structures into RAG workflows improves answer precision by up to 35% compared to vector-only retrieval methods. In this post, we explore why GraphRAG is more comprehensive and explainable than vector RAG alone, and how you can use this approach using AWS services and Lettria.

Denise Gosnell12/23/2024

amazon-web-services

Amazon-Q Amazon-Q-Business Generative-AI

Add a generative AI experience to your website or web application with Amazon Q embedded

Amazon Q embedded is a feature that lets you embed a hosted Amazon Q Business assistant on your website or application to create more personalized experiences that boost end-users’ productivity. In this post, we demonstrate how to use the Amazon Q embedded feature to add an Amazon Q Business assistant to your website or web application using basic HTML or React.

Bobby Williams12/19/2024

amazon-web-services

Amazon-Bedrock Amazon-Machine-Learning Amazon-SageMaker AIML Data-Preparation

An introduction to preparing your own dataset for LLM training

In this blog post, we provide an introduction to preparing your own dataset for LLM training. Whether your goal is to fine-tune a pre-trained model for a specific task or to continue pre-training for domain-specific applications, having a well-curated dataset is crucial for achieving optimal performance.

Simon Zamarin12/19/2024

amazon-web-services

Amazon-Bedrock Amazon-Bedrock-Agents Amazon-Bedrock-Knowledge-Bases Generative-AI AIML

Design multi-agent orchestration with reasoning using Amazon Bedrock and open source frameworks

This post provides step-by-step instructions for creating a collaborative multi-agent framework with reasoning capabilities to decouple business applications from FMs. It demonstrates how to combine Amazon Bedrock Agents with open source multi-agent frameworks, enabling collaborations and reasoning among agents to dynamically execute various tasks. The exercise will guide you through the process of building a reasoning orchestration system using Amazon Bedrock, Amazon Bedrock Knowledge Bases, Amazon Bedrock Agents, and FMs. We also explore the integration of Amazon Bedrock Agents with open source orchestration frameworks LangGraph and CrewAI for dispatching and reasoning.

Alfred Shen12/19/2024

amazon-web-services

Amazon-SageMaker-HyperPod Customer-Solutions Generative-AI

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model

Fastweb, one of Italy’s leading telecommunications operators, recognized the immense potential of AI technologies early on and began investing in this area in 2019. In this post, we explore how Fastweb used cutting-edge AI and ML services to embark on their LLM journey, overcoming challenges and unlocking new opportunities along the way.

Marta Cavalleri12/18/2024

amazon-web-services

Amazon-Machine-Learning Amazon-Q-Business Generative-AI AIML artificial-intelligence automation machine-learning Natural-Language-Processing re:Invent

Using natural language in Amazon Q Business: From searching and creating ServiceNow incidents and knowledge articles to generating insights

In this post, we’ll demonstrate how to configure an Amazon Q Business application and add a custom plugin that gives users the ability to use a natural language interface provided by Amazon Q Business to query real-time data and take actions in ServiceNow.

Siddhartha Angara12/18/2024

amazon-web-services

Amazon-Bedrock Amazon-Machine-Learning Artificial-Intelligence Generative-AI Launch

Simplify multimodal generative AI with Amazon Bedrock Data Automation

Amazon Bedrock Data Automation in public preview, offers a unified experience for developers of all skillsets to easily automate the extraction, transformation, and generation of relevant insights from documents, images, audio, and videos to build generative AI–powered applications. In this post, we demonstrate how to use Amazon Bedrock Data Automation in the AWS Management Console and the AWS SDK for Python (Boto3) for media analysis and intelligent document processing (IDP) workflows.

Ian Lodge12/17/2024

amazon-web-services

Advanced-(300)Amazon-Bedrock Amazon-Machine-Learning Amazon-SageMaker Amazon-SageMaker-JumpStart Architecture Customer-Solutions Europe Generative-AI Technical-How-to Travel-and-Hospitality

How TUI uses Amazon Bedrock to scale content creation and enhance hotel descriptions in under 10 seconds

TUI Group is one of the world’s leading global tourism services, providing 21 million customers with an unmatched holiday experience in 180 regions. The TUI content teams are tasked with producing high-quality content for its websites, including product details, hotel information, and travel guides, often using descriptions written by hotel and third-party partners. In this post, we discuss how we used Amazon SageMaker and Amazon Bedrock to build a content generator that rewrites marketing content following specific brand and style guidelines.

Hin Yee Liu12/17/2024

amazon-web-services

Amazon-SageMaker Amazon-SageMaker-JumpStart Announcements Artificial-Intelligence

Llama 3.3 70B now available in Amazon SageMaker JumpStart

Today, we are excited to announce that the Llama 3.3 70B from Meta is available in Amazon SageMaker JumpStart. Llama 3.3 70B marks an exciting advancement in large language model (LLM) development, offering comparable performance to larger Llama versions with fewer computational resources. In this post, we explore how to deploy this model efficiently on Amazon SageMaker AI, using advanced SageMaker AI features for optimal performance and cost management.

Marc Karp12/17/2024

amazon-web-services

AWS re:Invent 2024 Highlights: Top takeaways from Swami Sivasubramanian to help customers manage generative AI at scale

We spoke with Dr. Swami Sivasubramanian, Vice President of Data and AI, shortly after AWS re:Invent 2024 to hear his impressions—and to get insights on how the latest AWS innovations help meet the real-world needs of customers as they build and scale transformative generative AI applications.

Swami Sivasubramanian12/16/2024

amazon-web-services

Amazon-Bedrock Amazon-Bedrock-Knowledge-Bases Amazon-OpenSearch-Service Best-Practices Technical-How-to

Multi-tenant RAG with Amazon Bedrock Knowledge Bases

Organizations are continuously seeking ways to use their proprietary knowledge and domain expertise to gain a competitive edge. With the advent of foundation models (FMs) and their remarkable natural language processing capabilities, a new opportunity has emerged to unlock the value of their data assets. As organizations strive to deliver personalized experiences to customers using […]

Emanuele Levi12/16/2024

amazon-web-services

Amazon-SageMaker Artificial-Intelligence Customer-Solutions

How Amazon trains sequential ensemble models at scale with Amazon SageMaker Pipelines

Ensemble models are becoming popular within the ML communities. They generate more accurate predictions through combining the predictions of multiple models. Pipelines can quickly be used to create and end-to-end ML pipeline for ensemble models. This enables developers to build highly accurate models while maintaining efficiency, and reproducibility. In this post, we provide an example of an ensemble model that was trained and deployed using Pipelines.

Bikram Singh12/13/2024