APIs, Engineering and Metrics - Customer Contact Central

Amazon Bedrock launches Session Management APIs for generative AI applications (Preview)

AWS Machine Learning

MARCH 25, 2025

Amazon Bedrock announces the preview launch of Session Management APIs, a new capability that enables developers to simplify state and context management for generative AI applications built with popular open source frameworks such as LangGraph and LlamaIndex. Building generative AI applications requires more than model API calls.

APIs

APIs Management Government Healthcare

Build a video insights and summarization engine using generative AI with Amazon Bedrock

AWS Machine Learning

OCTOBER 29, 2024

This post presents a solution where you can upload a recording of your meeting (a feature available in most modern digital communication services such as Amazon Chime ) to a centralized video insights and summarization engine. All of this data is centralized and can be used to improve metrics in scenarios such as sales or call centers.

Engineering

Engineering APIs Sales Enterprise

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

AWS Machine Learning

NOVEMBER 13, 2024

A reverse image search engine enables users to upload an image to find related information instead of using text-based queries. The Amazon Bedrock single API access, regardless of the models you choose, gives you the flexibility to use different FMs and upgrade to the latest model versions with minimal code changes.

Engineering

Engineering Management APIs Healthcare

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning

NOVEMBER 15, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

APIs

APIs Government Best practices Metrics

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning

NOVEMBER 7, 2024

It also uses a number of other AWS services such as Amazon API Gateway , AWS Lambda , and Amazon SageMaker. API Gateway is serverless and hence automatically scales with traffic. API Gateway also provides a WebSocket API. Incoming requests to the gateway go through this point.

Enterprise

Enterprise APIs Government Accountability

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

AWS Machine Learning

JANUARY 28, 2025

By documenting the specific model versions, fine-tuning parameters, and prompt engineering techniques employed, teams can better understand the factors contributing to their AI systems performance. Evaluation algorithm Computes evaluation metrics to model outputs. Different algorithms have different metrics to be specified.

Management

Management APIs Engineering Metrics

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning

NOVEMBER 14, 2024

Customers can use the SageMaker Studio UI or APIs to specify the SageMaker Model Registry model to be shared and grant access to specific AWS accounts or to everyone in the organization. With this launch, customers can now seamlessly share and access ML models registered in SageMaker Model Registry between different AWS accounts.

Government

Government Management APIs Accountability

LLM-as-a-judge on Amazon Bedrock Model Evaluation

AWS Machine Learning

FEBRUARY 12, 2025

This approach allows organizations to assess their AI models effectiveness using pre-defined metrics, making sure that the technology aligns with their specific needs and objectives. Curated judge models : Amazon Bedrock provides pre-selected, high-quality evaluation models with optimized prompt engineering for accurate assessments.

Metrics

Metrics Engineering Benchmark APIs

Generate training data and cost-effectively train categorical models with Amazon Bedrock

AWS Machine Learning

MARCH 27, 2025

This requirement translates into time and effort investment of trained personnel, who could be support engineers or other technical staff, to review tens of thousands of support cases to arrive at an even distribution of 3,000 per category. If the use case doesnt yield discrete outputs, task-specific metrics are more appropriate.

Education

Education Engineering APIs Enterprise

Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

AWS Machine Learning

MARCH 25, 2025

Automated safety guards Integrated Amazon CloudWatch alarms monitor metrics on an inference component. AlarmName This CloudWatch alarm is configured to monitor metrics on an InferenceComponent. For more information about the SageMaker AI API, refer to the SageMaker AI API Reference.

APIs

APIs Engineering Accountability Metrics

How Formula 1® uses generative AI to accelerate race-day issue resolution

AWS Machine Learning

FEBRUARY 18, 2025

During these live events, F1 IT engineers must triage critical issues across its services, such as network degradation to one of its APIs. This impacts downstream services that consume data from the API, including products such as F1 TV, which offer live and on-demand coverage of every race as well as real-time telemetry.

APIs

APIs Engineering Metrics Big data

Few-shot prompt engineering and fine-tuning for LLMs in Amazon Bedrock

AWS Machine Learning

AUGUST 2, 2024

Investors and analysts closely watch key metrics like revenue growth, earnings per share, margins, cash flow, and projections to assess performance against peers and industry trends. Draft a comprehensive earnings call script that covers the key financial metrics, business highlights, and future outlook for the given quarter.

Engineering

Engineering Scripts Metrics APIs

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

AWS Machine Learning

MARCH 6, 2025

Current RAG pipelines frequently employ similarity-based metrics such as ROUGE , BLEU , and BERTScore to assess the quality of the generated responses, which is essential for refining and enhancing the models capabilities. More sophisticated metrics are needed to evaluate factual alignment and accuracy.

Metrics

Metrics Enterprise APIs Engineering

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning

NOVEMBER 15, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Chatbots

Chatbots Engineering Enterprise Government

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

AWS Machine Learning

DECEMBER 2, 2024

Use faster auto scaling metrics – Take advantage of more granular auto scaling metrics like ConcurrentRequestsPerCopy to more accurately monitor and react to changes in inference traffic. It’s a dynamic policy that adjusts the number of copies based on a specified metric, such as CPU utilization or request count.

APIs

APIs Best practices Metrics Engineering

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

AWS Machine Learning

JANUARY 28, 2025

To effectively optimize AI applications for responsiveness, we need to understand the key metrics that define latency and how they impact user experience. These metrics differ between streaming and nonstreaming modes and understanding them is crucial for building responsive AI applications.

Benchmark

Benchmark APIs Engineering Metrics

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning

DECEMBER 9, 2024

However, keeping track of numerous experiments, their parameters, metrics, and results can be difficult, especially when working on complex projects simultaneously. SageMaker is a comprehensive, fully managed ML service designed to provide data scientists and ML engineers with the tools they need to handle the entire ML workflow.

Metrics

Metrics APIs Engineering Accountability

Empower your generative AI application with a comprehensive custom observability solution

AWS Machine Learning

OCTOBER 29, 2024

Observability refers to the ability to understand the internal state and behavior of a system by analyzing its outputs, logs, and metrics. Evaluation, on the other hand, involves assessing the quality and relevance of the generated outputs, enabling continual improvement.

Best practices

Best practices Feedback Metrics APIs

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

How do Amazon Nova Micro and Amazon Nova Lite perform against GPT-4o mini in these same metrics? Amazon Bedrock APIs make it straightforward to use Amazon Titan Text Embeddings V2 for embedding data. Vector database FloTorch selected Amazon OpenSearch Service as a vector database for its high-performance metrics.

Benchmark

Benchmark APIs Enterprise Scripts

Unleashing the power of generative AI: Verisk’s journey to an Instant Insight Engine for enhanced customer support

AWS Machine Learning

MAY 9, 2024

Verisk has embraced this technology and has developed their own Instant Insight Engine, or AI companion, that provides an enhanced self-service capability to their FAST platform. First, they used the Amazon Kendra Retrieve API to get multiple relevant passages and excerpts based on keyword search.

Engineering

Engineering Customer Support APIs Government

Build an image search engine with Amazon Kendra and Amazon Rekognition

AWS Machine Learning

MAY 5, 2023

To address the problems associated with complex searches, this post describes in detail how you can achieve a search engine that is capable of searching for complex images by integrating Amazon Kendra and Amazon Rekognition. Users may have to manually filter out unsuitable image results when dealing with complex searches.

Engineering

Engineering APIs Scripts Enterprise

Model customization, RAG, or both: A case study with Amazon Nova

AWS Machine Learning

APRIL 10, 2025

Fine-tune an Amazon Nova model using the Amazon Bedrock API In this section, we provide detailed walkthroughs on fine-tuning and hosting customized Amazon Nova models using Amazon Bedrock. We first provided a detailed walkthrough on how to fine-tune, host, and conduct inference with customized Amazon Nova through the Amazon Bedrock API.

APIs

APIs Metrics Best practices Construction

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

AWS Machine Learning

SEPTEMBER 20, 2024

Amazon Bedrock agents use LLMs to break down tasks, interact dynamically with users, run actions through API calls, and augment knowledge using Amazon Bedrock Knowledge Bases. In this post, we demonstrate how to use Amazon Bedrock Agents with a web search API to integrate dynamic web content in your generative AI application.

APIs

APIs Chatbots Construction Engineering

Foundational vision models and visual prompt engineering for autonomous driving applications

AWS Machine Learning

NOVEMBER 15, 2023

Prompt engineering has become an essential skill for anyone working with large language models (LLMs) to generate high-quality and relevant texts. Although text prompt engineering has been widely discussed, visual prompt engineering is an emerging field that requires attention.

Engineering

Engineering APIs Automotive Entertainment

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

AWS Machine Learning

NOVEMBER 14, 2024

It enables you to privately customize the FM of your choice with your data using techniques such as fine-tuning, prompt engineering, and retrieval augmented generation (RAG) and build agents that run tasks using your enterprise systems and data sources while adhering to security and privacy requirements.

APIs

APIs Engineering Personalization Transportation

Evaluate large language models for your machine translation tasks on AWS

AWS Machine Learning

JANUARY 7, 2025

The solution proposed in this post relies on LLMs context learning capabilities and prompt engineering. The translation playground could be adapted into a scalable serverless solution as represented by the following diagram using AWS Lambda , Amazon Simple Storage Service (Amazon S3), and Amazon API Gateway.

Engineering

Engineering Metrics industry standards Finance

Transitioning off Amazon Lookout for Metrics

AWS Machine Learning

OCTOBER 9, 2024

Amazon Lookout for Metrics is a fully managed service that uses machine learning (ML) to detect anomalies in virtually any time-series business or operational metrics—such as revenue performance, purchase transactions, and customer acquisition and retention rates—with no ML experience required.

Metrics

Metrics APIs Engineering Accountability

Speed up your time series forecasting by up to 50 percent with Amazon SageMaker Canvas UI and AutoML APIs

AWS Machine Learning

SEPTEMBER 28, 2023

In this post, we describe the enhancements to the forecasting capabilities of SageMaker Canvas and guide you on using its user interface (UI) and AutoML APIs for time-series forecasting. While the SageMaker Canvas UI offers a code-free visual interface, the APIs empower developers to interact with these features programmatically.

APIs

APIs Construction Finance Enterprise

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

AWS Machine Learning

MARCH 14, 2025

Although automated metrics are fast and cost-effective, they can only evaluate the correctness of an AI response, without capturing other evaluation dimensions or providing explanations of why an answer is problematic. Human evaluation, although thorough, is time-consuming and expensive at scale.

Metrics

Metrics Best practices Engineering APIs

Pixtral Large is now available in Amazon Bedrock

AWS Machine Learning

APRIL 10, 2025

For instance, Pixtral Large is highly effective at spotting irregularities or insightful trends within training loss curves or performance metrics, enhancing the accuracy of data-driven decision-making. By choosing View API , you can also access the model using code examples in the AWS Command Line Interface (AWS CLI) and AWS SDKs.

APIs

APIs Management Marketing Engineering

How Vericast optimized feature engineering using Amazon SageMaker Processing

AWS Machine Learning

MAY 3, 2023

One aspect of this data preparation is feature engineering. Feature engineering refers to the process where relevant variables are identified, selected, and manipulated to transform the raw data into more useful and usable forms for use with the ML algorithm used to train a model and perform inference against it.

Engineering

Engineering Metrics APIs Big data

Enable pod-based GPU metrics in Amazon CloudWatch

AWS Machine Learning

SEPTEMBER 7, 2023

In February 2022, Amazon Web Services added support for NVIDIA GPU metrics in Amazon CloudWatch , making it possible to push metrics from the Amazon CloudWatch Agent to Amazon CloudWatch and monitor your code for optimal GPU utilization. Then we explore two architectures. already installed. eks-create.sh 19 private:192.168.128.0/19

Metrics

Metrics APIs Management Engineering

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

AWS Machine Learning

FEBRUARY 25, 2025

Amazon Q Business only provides metric information that you can use to monitor your data source sync jobs. With the connector ready, move over to the SageMaker Studio notebook and perform data synchronization operations by invoking Amazon Q Business APIs. secrets_manager_client = boto3.client('secretsmanager')

Enterprise

Enterprise Engineering APIs Accountability

Enhance conversational AI with advanced routing techniques with Amazon Bedrock

AWS Machine Learning

APRIL 24, 2024

Conversational artificial intelligence (AI) assistants are engineered to provide precise, real-time responses through intelligent routing of queries to the most suitable AI functions. With AWS generative AI services like Amazon Bedrock , developers can create systems that expertly manage and respond to user requests. What is an AI assistant?

APIs

APIs Engineering Metrics Management

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning

NOVEMBER 19, 2024

Leave the session inspired to bring Amazon Q Apps to supercharge your teams’ productivity engines. Gain insights into training strategies, productivity metrics, and real-world use cases to empower your developers to harness the full potential of this game-changing technology.

APIs

APIs Enterprise Best practices Government

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

AWS Machine Learning

JANUARY 10, 2024

Specialist Data Engineering at Merck, and Prabakaran Mathaiyan, Sr. ML Engineer at Tiger Analytics. The solution uses AWS Lambda , Amazon API Gateway , Amazon EventBridge , and SageMaker to automate the workflow with human approval intervention in the middle. API Gateway invokes a Lambda function to initiate model updates.

APIs

APIs Construction Engineering Analytics

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

AWS Machine Learning

OCTOBER 2, 2024

It’s a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like Anthropic, Cohere, Meta, Mistral AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Government

Government APIs Enterprise Best practices

Achieve DevOps maturity with BMC AMI zAdviser Enterprise and Amazon Bedrock

AWS Machine Learning

MARCH 27, 2024

In software engineering, there is a direct correlation between team performance and building robust, stable applications. The data community aims to adopt the rigorous engineering principles commonly used in software development into their own practices, which includes systematic approaches to design, development, testing, and maintenance.

Enterprise

Enterprise APIs Metrics Engineering

Fine-tune Anthropic’s Claude 3 Haiku in Amazon Bedrock to boost model accuracy and quality

AWS Machine Learning

JULY 10, 2024

This process enhances task-specific model performance, allowing the model to handle custom use cases with task-specific performance metrics that meet or surpass more powerful models like Anthropic Claude 3 Sonnet or Anthropic Claude 3 Opus. Under Output data , for S3 location , enter the S3 path for the bucket storing fine-tuning metrics.

APIs

APIs Airlines Metrics Engineering

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

AWS Machine Learning

FEBRUARY 24, 2025

Performance metrics and benchmarks According to Mistral, the instruction-tuned version of the model achieves over 81% accuracy on Massive Multitask Language Understanding (MMLU) with 150 tokens per second latency, making it currently the most efficient model in its category. It doesnt support Converse APIs or other Amazon Bedrock tooling.

APIs

APIs Enterprise Benchmark Feedback

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning

MAY 30, 2024

AWS Prototyping successfully delivered a scalable prototype, which solved CBRE’s business problem with a high accuracy rate (over 95%) and supported reuse of embeddings for similar NLQs, and an API gateway for integration into CBRE’s dashboards. The following diagram illustrates the web interface and API management layer.

Real estate

Real estate APIs Metrics Construction

Redacting PII data at The Very Group with Amazon Comprehend

AWS Machine Learning

JANUARY 12, 2023

This is guest post by Andy Whittle, Principal Platform Engineer – Application & Reliability Frameworks at The Very Group. The overriding goal for The Very Group’s engineering team was to prevent any PII data from reaching documents within Elasticsearch. Overview of solution. Some decisions had to be made to enable the solution.

Engineering

Engineering APIs Accountability Metrics

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

AWS Machine Learning

JULY 31, 2023

It also enables you to evaluate the models using advanced metrics as if you were a data scientist. In this post, we show how a business analyst can evaluate and understand a classification churn model created with SageMaker Canvas using the Advanced metrics tab. The F1 score provides a balanced evaluation of the model’s performance.

Metrics

Metrics Engineering Accountability Telecommunications

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

AWS Machine Learning

FEBRUARY 25, 2025

In 2021, Applus+ IDIADA , a global partner to the automotive industry with over 30 years of experience supporting customers in product development activities through design, engineering, testing, and homologation services, established the Digital Solutions department. The batch size is set to 64.

Chatbots

Chatbots Engineering Automotive Metrics

Amazon Bedrock launches Session Management APIs for generative AI applications (Preview)

Build a video insights and summarization engine using generative AI with Amazon Bedrock

Trending Sources

Build a reverse image search engine with Amazon Titan Multimodal Embeddings in Amazon Bedrock and AWS managed services

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Build a multi-tenant generative AI environment for your enterprise on AWS

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

LLM-as-a-judge on Amazon Bedrock Model Evaluation

Generate training data and cost-effectively train categorical models with Amazon Bedrock

Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

How Formula 1® uses generative AI to accelerate race-day issue resolution

Few-shot prompt engineering and fine-tuning for LLMs in Amazon Bedrock

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Empower your generative AI application with a comprehensive custom observability solution

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Unleashing the power of generative AI: Verisk’s journey to an Instant Insight Engine for enhanced customer support

Build an image search engine with Amazon Kendra and Amazon Rekognition

Model customization, RAG, or both: A case study with Amazon Nova

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

Foundational vision models and visual prompt engineering for autonomous driving applications

Revolutionize trip planning with Amazon Bedrock and Amazon Location Service

Evaluate large language models for your machine translation tasks on AWS

Transitioning off Amazon Lookout for Metrics

Speed up your time series forecasting by up to 50 percent with Amazon SageMaker Canvas UI and AutoML APIs

Evaluating RAG applications with Amazon Bedrock knowledge base evaluation

Pixtral Large is now available in Amazon Bedrock

How Vericast optimized feature engineering using Amazon SageMaker Processing

Enable pod-based GPU metrics in Amazon CloudWatch

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

Enhance conversational AI with advanced routing techniques with Amazon Bedrock

Your guide to generative AI and ML at AWS re:Invent 2024

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Achieve DevOps maturity with BMC AMI zAdviser Enterprise and Amazon Bedrock

Fine-tune Anthropic’s Claude 3 Haiku in Amazon Bedrock to boost model accuracy and quality

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Redacting PII data at The Very Group with Amazon Comprehend

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

Stay Connected