APIs, Examples and Metrics - Customer Contact Central

Amazon Bedrock launches Session Management APIs for generative AI applications (Preview)

AWS Machine Learning

MARCH 25, 2025

Amazon Bedrock announces the preview launch of Session Management APIs, a new capability that enables developers to simplify state and context management for generative AI applications built with popular open source frameworks such as LangGraph and LlamaIndex. Building generative AI applications requires more than model API calls.

APIs

APIs Management Government Healthcare

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning

NOVEMBER 14, 2024

Customers can use the SageMaker Studio UI or APIs to specify the SageMaker Model Registry model to be shared and grant access to specific AWS accounts or to everyone in the organization. With this launch, customers can now seamlessly share and access ML models registered in SageMaker Model Registry between different AWS accounts.

Government

Government Management APIs Accountability

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

AWS Machine Learning

JANUARY 28, 2025

Evaluation algorithm Computes evaluation metrics to model outputs. Different algorithms have different metrics to be specified. It functions as a standalone HTTP server that provides various REST API endpoints for monitoring, recording, and visualizing experiment runs. This allows you to keep track of your ML experiments.

Management

Management APIs Engineering Metrics

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

AWS Machine Learning

DECEMBER 4, 2024

They have structured data such as sales transactions and revenue metrics stored in databases, alongside unstructured data such as customer reviews and marketing reports collected from various channels. This includes setting up Amazon API Gateway , AWS Lambda functions, and Amazon Athena to enable querying the structured sales data.

APIs

APIs Sales Surveys Analytics

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. It contains services used to onboard, manage, and operate the environment, for example, to onboard and off-board tenants, users, and models, assign quotas to different tenants, and authentication and authorization microservices.

Enterprise

Enterprise APIs Government Accountability

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning

NOVEMBER 15, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

APIs

APIs Government Best practices Metrics

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning

OCTOBER 29, 2024

All of this data is centralized and can be used to improve metrics in scenarios such as sales or call centers. We walk through the key components and services needed to build the end-to-end architecture, offering example code snippets and explanations for each critical element that help achieve the core functionality.

Engineering

Engineering APIs Sales Enterprise

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Evaluation, on the other hand, involves assessing the quality and relevance of the generated outputs, enabling continual improvement.

Best practices

Best practices Feedback Metrics APIs

LLM-as-a-judge on Amazon Bedrock Model Evaluation

AWS Machine Learning

FEBRUARY 12, 2025

This approach allows organizations to assess their AI models effectiveness using pre-defined metrics, making sure that the technology aligns with their specific needs and objectives. The introduction of an LLM-as-a-judge framework represents a significant step forward in simplifying and streamlining the model evaluation process.

Metrics

Metrics Engineering Benchmark APIs

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

How do Amazon Nova Micro and Amazon Nova Lite perform against GPT-4o mini in these same metrics? The following table provides example questions with their domain and question type. Amazon Bedrock APIs make it straightforward to use Amazon Titan Text Embeddings V2 for embedding data. get("message", {}).get("content")

Benchmark

Benchmark APIs Enterprise Scripts

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

AWS Machine Learning

MARCH 6, 2025

Current RAG pipelines frequently employ similarity-based metrics such as ROUGE , BLEU , and BERTScore to assess the quality of the generated responses, which is essential for refining and enhancing the models capabilities. More sophisticated metrics are needed to evaluate factual alignment and accuracy.

Metrics

Metrics Enterprise APIs Engineering

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

AWS Machine Learning

DECEMBER 2, 2024

Use faster auto scaling metrics – Take advantage of more granular auto scaling metrics like ConcurrentRequestsPerCopy to more accurately monitor and react to changes in inference traffic. It’s a dynamic policy that adjusts the number of copies based on a specified metric, such as CPU utilization or request count.

APIs

APIs Best practices Metrics Engineering

Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

AWS Machine Learning

MARCH 25, 2025

Then we deep dive into the new rolling update feature for inference components and provide practical examples using DeepSeek distilled models to demonstrate this feature. Consider an example where a customer has 10 copies of an inference component spread across 5 ml.p4d.24xlarge You can find the example notebook in the GitHub repo.

APIs

APIs Engineering Accountability Metrics

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

AWS Machine Learning

AUGUST 2, 2024

adds new APIs to customize GraphStorm pipelines: you now only need 12 lines of code to implement a custom node classification training loop. To help you get started with the new API, we have published two Jupyter notebook examples: one for node classification, and one for a link prediction task. Specifically, GraphStorm 0.3

APIs

APIs Benchmark Construction Enterprise

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning

DECEMBER 9, 2024

However, keeping track of numerous experiments, their parameters, metrics, and results can be difficult, especially when working on complex projects simultaneously. For example, you can give users access permission to download popular packages and customize the development environment. config_yaml = f""" SchemaVersion: '1.0'

Metrics

Metrics APIs Engineering Accountability

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning

APRIL 10, 2025

Fine-tune an Amazon Nova model using the Amazon Bedrock API In this section, we provide detailed walkthroughs on fine-tuning and hosting customized Amazon Nova models using Amazon Bedrock. To do so, we create a knowledge base. The following diagram illustrates the solution architecture. For Job name , enter a name for the fine-tuning job.

APIs

APIs Metrics Best practices Construction

Generate training data and cost-effectively train categorical models with Amazon Bedrock

AWS Machine Learning

MARCH 27, 2025

We also showcase a real-world example for predicting the root cause category for support cases. For the use case of labeling the support root cause categories, its often harder to source examples for categories such as Software Defect, Feature Request, and Documentation Improvement for labeling than it is for Customer Education.

Education

Education Engineering APIs Enterprise

Orchestrate an intelligent document processing workflow using tools in Amazon Bedrock

AWS Machine Learning

FEBRUARY 21, 2025

This serves as an example of how generative AI can streamline operations that involve diverse data types and formats. The solution uses the FMs tool use capabilities, accessed through the Amazon Bedrock Converse API. Use case and dataset For our example use case, we examine a patient intake process at a healthcare institution.

APIs

APIs Healthcare Consulting Consulting

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning

APRIL 10, 2025

For instance, Pixtral Large is highly effective at spotting irregularities or insightful trends within training loss curves or performance metrics, enhancing the accuracy of data-driven decision-making. By choosing View API , you can also access the model using code examples in the AWS Command Line Interface (AWS CLI) and AWS SDKs.

APIs

APIs Management Marketing Engineering

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

AWS Machine Learning

JANUARY 28, 2025

To effectively optimize AI applications for responsiveness, we need to understand the key metrics that define latency and how they impact user experience. These metrics differ between streaming and nonstreaming modes and understanding them is crucial for building responsive AI applications.

Benchmark

Benchmark APIs Engineering Metrics

Speed up your time series forecasting by up to 50 percent with Amazon SageMaker Canvas UI and AutoML APIs

AWS Machine Learning

SEPTEMBER 28, 2023

As an example, time-series forecasting allows retailers to predict future sales demand and plan for inventory levels, logistics, and marketing campaigns. In this post, we describe the enhancements to the forecasting capabilities of SageMaker Canvas and guide you on using its user interface (UI) and AutoML APIs for time-series forecasting.

APIs

APIs Construction Finance Enterprise

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

AWS Machine Learning

NOVEMBER 20, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Construction

Construction APIs Metrics Accountability

How Formula 1® uses generative AI to accelerate race-day issue resolution

AWS Machine Learning

FEBRUARY 18, 2025

During these live events, F1 IT engineers must triage critical issues across its services, such as network degradation to one of its APIs. This impacts downstream services that consume data from the API, including products such as F1 TV, which offer live and on-demand coverage of every race as well as real-time telemetry.

APIs

APIs Engineering Metrics Big data

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning

MARCH 3, 2025

Designed for both image and document comprehension, Pixtral demonstrates advanced capabilities in vision-related tasks, including chart and figure interpretation, document question answering, multimodal reasoning, and instruction followingseveral of which are illustrated with examples later in this post. Lets explore an example.

Benchmark

Benchmark APIs Enterprise Construction

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning

NOVEMBER 13, 2024

For example, searching for a specific red leather handbag with a gold chain using text alone can be cumbersome and imprecise, often yielding results that don’t directly match the user’s intent. Amazon Titan FMs provide customers with a breadth of high-performing image, multimodal, and text model choices, through a fully managed API.

Engineering

Engineering Management APIs Healthcare

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

AWS Machine Learning

SEPTEMBER 20, 2024

Amazon Bedrock agents use LLMs to break down tasks, interact dynamically with users, run actions through API calls, and augment knowledge using Amazon Bedrock Knowledge Bases. In this post, we demonstrate how to use Amazon Bedrock Agents with a web search API to integrate dynamic web content in your generative AI application.

APIs

APIs Chatbots Construction Engineering

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning

NOVEMBER 14, 2024

The user’s request is sent to AWS API Gateway , which triggers a Lambda function to interact with Amazon Bedrock using Anthropic’s Claude Instant V1 FM to process the user’s request and generate a natural language response of the place location. Here is an example from LangChain.

APIs

APIs Engineering Personalization Transportation

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning

MARCH 14, 2025

Although automated metrics are fast and cost-effective, they can only evaluate the correctness of an AI response, without capturing other evaluation dimensions or providing explanations of why an answer is problematic. Human evaluation, although thorough, is time-consuming and expensive at scale.

Metrics

Metrics Best practices Engineering APIs

How to decide between Amazon Rekognition image and video API for video moderation

AWS Machine Learning

FEBRUARY 1, 2023

Amazon Rekognition has two sets of APIs that help you moderate images or videos to keep digital communities safe and engaged. Some customers have asked if they could use this approach to moderate videos by sampling image frames and sending them to the Amazon Rekognition image moderation API.

APIs

APIs Scripts Metrics Surveys

Enable pod-based GPU metrics in Amazon CloudWatch

AWS Machine Learning

SEPTEMBER 7, 2023

In February 2022, Amazon Web Services added support for NVIDIA GPU metrics in Amazon CloudWatch , making it possible to push metrics from the Amazon CloudWatch Agent to Amazon CloudWatch and monitor your code for optimal GPU utilization. Then we explore two architectures. already installed.

Metrics

Metrics APIs Management Engineering

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

AWS Machine Learning

FEBRUARY 24, 2025

Performance metrics and benchmarks According to Mistral, the instruction-tuned version of the model achieves over 81% accuracy on Massive Multitask Language Understanding (MMLU) with 150 tokens per second latency, making it currently the most efficient model in its category. It doesnt support Converse APIs or other Amazon Bedrock tooling.

APIs

APIs Enterprise Benchmark Feedback

How Untold Studios empowers artists with an AI assistant built on Amazon Bedrock

AWS Machine Learning

FEBRUARY 7, 2025

The implementation uses Slacks event subscription API to process incoming messages and Slacks Web API to send responses. The following screenshot shows an example. The incoming event from Slack is sent to an endpoint in API Gateway, and Slack expects a response in less than 3 seconds, otherwise the request fails.

Entertainment

Entertainment APIs Technology Feedback

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning

NOVEMBER 15, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Chatbots

Chatbots Engineering Enterprise Government

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 1

AWS Machine Learning

OCTOBER 24, 2024

Anatomy of RAG RAG is an efficient way to provide an FM with additional knowledge by using external data sources and is depicted in the following diagram: Retrieval : Based on a user’s question (1), relevant information is retrieved from a knowledge base (2) (for example, an OpenSearch index).

Chatbots

Chatbots Metrics Scripts APIs

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning

NOVEMBER 19, 2024

Hear from customer speakers with real-world examples of how they’ve used data to support a variety of use cases, including generative AI, to create unique customer experiences. Discover how to create and manage evaluation jobs, use automatic and human reviews, and analyze critical metrics like accuracy, robustness, and toxicity.

APIs

APIs Enterprise Best practices Government

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

AWS Machine Learning

OCTOBER 16, 2024

Frontend and API The CQ application offers a robust search interface specially crafted for call quality agents, equipping them with powerful auditing capabilities for call analysis. Call Quality Trend Dashboard The following figure is an example of the Call Quality Trend Dashboard, showing the information available to agents.

Customer Service

Customer Service Scripts Coaching Finance

Evaluate large language models for your machine translation tasks on AWS

AWS Machine Learning

JANUARY 7, 2025

The translation playground could be adapted into a scalable serverless solution as represented by the following diagram using AWS Lambda , Amazon Simple Storage Service (Amazon S3), and Amazon API Gateway. For this example, the translated text, although accurate, is close to a literal translation, which is not a common phrasing in French.

Engineering

Engineering Metrics industry standards Analytics

Build a loyalty points anomaly detector using Amazon Lookout for Metrics

AWS Machine Learning

JANUARY 25, 2023

For example, a fast food chain has launched its earn and burn loyalty pilot program in some locations. This post shows you how to use an integrated solution with Amazon Lookout for Metrics to break these barriers by quickly and easily detecting anomalies in the key performance indicators (KPIs) of your interest. Choose Upload.

Metrics

Metrics APIs Enterprise Analytics

Combine keyword and semantic search for text and images using Amazon Bedrock and Amazon OpenSearch Service

AWS Machine Learning

APRIL 24, 2025

A seamless search journey not only enhances the overall user experience, but also directly impacts key business metrics such as conversion rates, average order value, and customer loyalty. Send the text, images, and metadata to Amazon Bedrock using its API to generate embeddings using the Amazon Titan Multimodal Embeddings G1 model.

Engineering

Engineering APIs Enterprise Accountability

Improve AI assistant response accuracy using Knowledge Bases for Amazon Bedrock and a reranking model

AWS Machine Learning

AUGUST 7, 2024

For example, retrieving responses from its database before generating a response could provide more relevant and coherent responses. We then retrieve answers using standard RAG and a two-stage RAG, which involves a reranking API. The framework provides a suite of metrics to evaluate different dimensions. join(batch_text_arr) s3.put_object(

APIs

APIs Chatbots Metrics Education

How Veritone uses Amazon Bedrock, Amazon Rekognition, Amazon Transcribe, and information retrieval to update their video search pipeline

AWS Machine Learning

MAY 7, 2024

Amazon Transcribe The transcription for the entire video is generated using the StartTranscriptionJob API. The raw transcripts are further processed to be stored using timestamps, as shown in the following example. The metadata generated for each video by the APIs is processed and stored with timestamps.

APIs

APIs Advertising Entertainment Metrics

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

AWS Machine Learning

OCTOBER 2, 2024

It’s a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like Anthropic, Cohere, Meta, Mistral AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Government

Government APIs Enterprise Best practices

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

AWS Machine Learning

FEBRUARY 25, 2025

For example, What are the top sections of the HR benefits policies? Amazon Q Business only provides metric information that you can use to monitor your data source sync jobs. Create sample Alation policies In our example, you would create three different sets of Alation policies for a fictional organization named Unicorn Rentals.

Enterprise

Enterprise Engineering APIs Accountability

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

AWS Machine Learning

FEBRUARY 25, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Chatbots

Chatbots Engineering Automotive Metrics

Amazon Bedrock launches Session Management APIs for generative AI applications (Preview)

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Trending Sources

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio

Build a multi-tenant generative AI environment for your enterprise on AWS

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Empower your generative AI application with a comprehensive custom observability solution

LLM-as-a-judge on Amazon Bedrock Model Evaluation

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Model customization, RAG, or both: A case study with Amazon Nova

Generate training data and cost-effectively train categorical models with Amazon Bedrock

Orchestrate an intelligent document processing workflow using tools in Amazon Bedrock

Pixtral Large is now available in Amazon Bedrock

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

Speed up your time series forecasting by up to 50 percent with Amazon SageMaker Canvas UI and AutoML APIs

Streamline RAG applications with intelligent metadata filtering using Amazon Bedrock

How Formula 1® uses generative AI to accelerate race-day issue resolution

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

How to decide between Amazon Rekognition image and video API for video moderation

Enable pod-based GPU metrics in Amazon CloudWatch

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

How Untold Studios empowers artists with an AI assistant built on Amazon Bedrock

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 1

Your guide to generative AI and ML at AWS re:Invent 2024

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

Evaluate large language models for your machine translation tasks on AWS

Build a loyalty points anomaly detector using Amazon Lookout for Metrics

Combine keyword and semantic search for text and images using Amazon Bedrock and Amazon OpenSearch Service

Improve AI assistant response accuracy using Knowledge Bases for Amazon Bedrock and a reranking model

How Veritone uses Amazon Bedrock, Amazon Rekognition, Amazon Transcribe, and information retrieval to update their video search pipeline

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

Stay Connected