APIs, Metrics and Scripts - Customer Contact Central

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

How do Amazon Nova Micro and Amazon Nova Lite perform against GPT-4o mini in these same metrics? Amazon Bedrock APIs make it straightforward to use Amazon Titan Text Embeddings V2 for embedding data. Vector database FloTorch selected Amazon OpenSearch Service as a vector database for its high-performance metrics.

Benchmark

Benchmark APIs Enterprise Scripts

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning

NOVEMBER 14, 2024

Customers can use the SageMaker Studio UI or APIs to specify the SageMaker Model Registry model to be shared and grant access to specific AWS accounts or to everyone in the organization. We will start by using the SageMaker Studio UI and then by using APIs.

Government

Government Management APIs Accountability

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning

NOVEMBER 15, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

APIs

APIs Government Best practices Metrics

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

AWS Machine Learning

OCTOBER 16, 2024

The goal was to refine customer service scripts, provide coaching opportunities for agents, and improve call handling processes. Frontend and API The CQ application offers a robust search interface specially crafted for call quality agents, equipping them with powerful auditing capabilities for call analysis.

Customer Service

Customer Service Scripts Coaching Finance

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

AWS Machine Learning

SEPTEMBER 20, 2024

Amazon Bedrock agents use LLMs to break down tasks, interact dynamically with users, run actions through API calls, and augment knowledge using Amazon Bedrock Knowledge Bases. In this post, we demonstrate how to use Amazon Bedrock Agents with a web search API to integrate dynamic web content in your generative AI application.

APIs

APIs Chatbots Construction Engineering

How to decide between Amazon Rekognition image and video API for video moderation

AWS Machine Learning

FEBRUARY 1, 2023

Amazon Rekognition has two sets of APIs that help you moderate images or videos to keep digital communities safe and engaged. Some customers have asked if they could use this approach to moderate videos by sampling image frames and sending them to the Amazon Rekognition image moderation API.

APIs

APIs Scripts Metrics Surveys

Few-shot prompt engineering and fine-tuning for LLMs in Amazon Bedrock

AWS Machine Learning

AUGUST 2, 2024

Investors and analysts closely watch key metrics like revenue growth, earnings per share, margins, cash flow, and projections to assess performance against peers and industry trends. Traditionally, earnings call scripts have followed similar templates, making it a repeatable task to generate them from scratch each time.

Engineering

Engineering Scripts Metrics APIs

Generate training data and cost-effectively train categorical models with Amazon Bedrock

AWS Machine Learning

MARCH 27, 2025

Where discrete outcomes with labeled data exist, standard ML methods such as precision, recall, or other classic ML metrics can be used. These metrics provide high precision but are limited to specific use cases due to limited ground truth data. If the use case doesnt yield discrete outputs, task-specific metrics are more appropriate.

Education

Education Engineering APIs Enterprise

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 1

AWS Machine Learning

OCTOBER 24, 2024

The retrieve_and_generate API does both the retrieval and a call to an FM (Amazon Titan or Anthropic’s Claude family of models on Amazon Bedrock ), for a fully managed solution. Mean Reciprocal Rank (MRR) – This metric considers the ranking of the retrieved documents. More advanced models such as Anthropic’s Claude Sonnet 3.5

Chatbots

Chatbots Metrics Scripts APIs

Build an air quality anomaly detector using Amazon Lookout for Metrics

AWS Machine Learning

AUGUST 11, 2022

This post shows you how to use an integrated solution with Amazon Lookout for Metrics and Amazon Kinesis Data Firehose to break these barriers by quickly and easily ingesting streaming data, and subsequently detecting anomalies in the key performance indicators of your interest. You don’t need ML experience to use Lookout for Metrics.

Metrics

Metrics APIs Scripts Wireless

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning

MAY 30, 2024

AWS Prototyping successfully delivered a scalable prototype, which solved CBRE’s business problem with a high accuracy rate (over 95%) and supported reuse of embeddings for similar NLQs, and an API gateway for integration into CBRE’s dashboards. The following diagram illustrates the web interface and API management layer.

Real estate

Real estate APIs Metrics Construction

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning

JULY 2, 2024

Image 2: Hugging Face NLP model inference performance improvement with torch.compile on AWS Graviton3-based c7g instance using Hugging Face example scripts. This section shows how to run inference in eager and torch.compile modes using torch Python wheels and benchmarking scripts from Hugging Face and TorchBench repos.

Benchmark

Benchmark Scripts Metrics APIs

Testing times: testingRTC is the smart, synchronized, real-world scenario WebRTC testing solution for the times we live in.

Spearline

JULY 21, 2022

Consequently, no other testing solution can provide the range and depth of testing metrics and analytics. And testingRTC offers multiple ways to export these metrics, from direct collection from webhooks, to downloading results in CSV format using the REST API. Happy days! You can check framerate information for video here too.

Scripts

Scripts APIs Metrics Analytics

Create a document lake using large-scale text extraction from documents with Amazon Textract

AWS Machine Learning

JANUARY 8, 2024

The first allows you to run a Python script from any server or instance including a Jupyter notebook; this is the quickest way to get started. In the following sections, we first describe the script solution, followed by the AWS CDK construct solution. The following diagram illustrates the sequence of events within the script.

Scripts

Scripts Construction APIs Healthcare

How Games24x7 transformed their retraining MLOps pipelines with Amazon SageMaker

AWS Machine Learning

APRIL 12, 2023

All the training and evaluation metrics were inspected manually from Amazon Simple Storage Service (Amazon S3). The code to invoke the pipeline script is available in the Studio notebooks, and we can change the hyperparameters and input/output when invoking the pipeline.

Scripts

Scripts APIs Metrics Best practices

How LotteON built a personalized recommendation system using Amazon SageMaker and MLOps

AWS Machine Learning

MAY 16, 2024

The main AWS services used are SageMaker, Amazon EMR , AWS CodeBuild , Amazon Simple Storage Service (Amazon S3), Amazon EventBridge , AWS Lambda , and Amazon API Gateway. Real-time recommendation inference The inference phase consists of the following steps: The client application makes an inference request to the API gateway.

Personalization

Personalization APIs Scripts Technical Support

Open source observability for AWS Inferentia nodes within Amazon EKS clusters

AWS Machine Learning

APRIL 17, 2024

Metrics allow teams to understand workload behavior and optimize resource allocation and utilization, diagnose anomalies, and increase overall infrastructure efficiency. Metrics are exposed to Amazon Managed Service for Prometheus by the neuron-monitor DaemonSet, which deploys a minimal container, with the Neuron tools installed.

APIs

APIs Metrics Accountability Management

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

AWS Machine Learning

APRIL 23, 2024

Here are some features which we will cover: AWS CloudFormation support Private network policies for Amazon OpenSearch Serverless Multiple S3 buckets as data sources Service Quotas support Hybrid search, metadata filters, custom prompts for the RetreiveAndGenerate API, and maximum number of retrievals.

APIs

APIs Enterprise Scripts Accountability

Semantic image search for articles using Amazon Rekognition, Amazon SageMaker foundation models, and Amazon OpenSearch Service

AWS Machine Learning

SEPTEMBER 8, 2023

Amazon Rekognition makes it easy to add image analysis capability to your applications without any machine learning (ML) expertise and comes with various APIs to fulfil use cases such as object detection, content moderation, face detection and analysis, and text and celebrity recognition, which we use in this example.

APIs

APIs Scripts Entertainment Personalization

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

AWS Machine Learning

MAY 31, 2024

Lastly the model is tested against a set of known genome sequences using some inference API calls. Training on SageMaker We use PyTorch and Amazon SageMaker script mode to train this model. Script mode’s compatibility with PyTorch was crucial, allowing us to use our existing scripts with minimal modifications.

Biotechnology

Biotechnology Scripts Healthcare APIs

Modernizing data science lifecycle management with AWS and Wipro

AWS Machine Learning

JANUARY 5, 2024

Continuous integration and continuous delivery (CI/CD) pipeline – Using the customer’s GitHub repository enabled code versioning and automated scripts to launch pipeline deployment whenever new versions of the code are committed. Wipro has used the input filter and join functionality of SageMaker batch transformation API.

Management

Management APIs Engineering Government

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

AWS Machine Learning

FEBRUARY 25, 2025

Amazon Q Business only provides metric information that you can use to monitor your data source sync jobs. We recommend running similar scripts only on your own data sources after consulting with the team who manages them, or be sure to follow the terms of service for the sources that youre trying to fetch data from.

Enterprise

Enterprise Engineering APIs Accountability

Automatically generate impressions from findings in radiology reports using generative AI on AWS

AWS Machine Learning

AUGUST 30, 2023

For a quantitative analysis of the generated impression, we use ROUGE (Recall-Oriented Understudy for Gisting Evaluation), the most commonly used metric for evaluating summarization. This metric compares an automatically produced summary against a reference or a set of references (human-produced) summary or translation.

Healthcare

Healthcare APIs Scripts Metrics

Scale AI training and inference for drug discovery through Amazon EKS and Karpenter

AWS Machine Learning

APRIL 19, 2024

Solution overview In this section, we present a generic architecture that is similar to the one we use for our own workloads, which allows elastic deployment of models using efficient auto scaling based on custom metrics. The reverse proxy collects metrics about calls to the service and exposes them via a standard metrics API to Prometheus.

Metrics

Metrics APIs Scripts Construction

Best practices for load testing Amazon SageMaker real-time inference endpoints

AWS Machine Learning

JANUARY 10, 2023

From there, we dive into how you can track and understand the metrics and performance of the SageMaker endpoint utilizing Amazon CloudWatch metrics. Metrics to track. Before we can get into load testing, it’s essential to understand what metrics to track to understand the performance breakdown of your SageMaker endpoint.

Best practices

Best practices Scripts APIs Metrics

Image classification model selection using Amazon SageMaker JumpStart

AWS Machine Learning

FEBRUARY 6, 2023

Together with the implementation details in a corresponding example Jupyter notebook , you will have tools available to perform model selection by exploring pareto frontiers, where improving one performance metric, such as accuracy, is not possible without worsening another metric, such as throughput.

APIs

APIs Scripts Metrics Benchmark

Use Amazon Titan models for image generation, editing, and searching

AWS Machine Learning

FEBRUARY 19, 2024

An asynchronous API and Amazon OpenSearch Service connector make it easy to integrate the model into your neural search applications. Before you can write scripts that use the Amazon Bedrock API, you need to install the appropriate version of the AWS SDK in your environment. The vectors power speedy, accurate search experiences.

Scripts

Scripts APIs Entertainment Engineering

Enhancing LLM Capabilities with NeMo Guardrails on Amazon SageMaker JumpStart

AWS Machine Learning

FEBRUARY 5, 2025

Lets delve into a basic Colang script to see how it works: define user express greeting "hello" "hi" "what's up?" define flow greeting user express greeting bot express greeting bot ask how are you In this script, we see the three fundamental types of blocks in Colang: User Message Blocks (define user ): These define possible user inputs.

Chatbots

Chatbots Construction Best practices APIs

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

AWS Machine Learning

JUNE 27, 2023

The ML components for data ingestion, preprocessing, and model training were available as disjointed Python scripts and notebooks, which required a lot of manual heavy lifting on the part of engineers. This step produces an expanded report containing the model’s metrics.

Engineering

Engineering APIs Scripts Entertainment

Build an end-to-end MLOps pipeline for visual quality inspection at the edge – Part 2

AWS Machine Learning

OCTOBER 2, 2023

For this we use AWS Step Functions , a serverless workflow service that provides us with API integrations to quickly orchestrate and visualize the steps in our workflow. Use the scripts created in step one as part of the processing and training steps. We started by creating command line scripts from the experiment code.

Scripts

Scripts Construction Engineering APIs

Revolutionizing large language model training with Arcee and AWS Trainium

AWS Machine Learning

APRIL 29, 2024

Dataset collection We followed the methodology outlined in the PMC-Llama paper [6] to assemble our dataset, which includes PubMed papers sourced from the Semantic Scholar API and various medical texts cited within the paper, culminating in a comprehensive collection of 88 billion tokens. Create and launch ParallelCluster in the VPC.

APIs

APIs Healthcare Scripts Enterprise

Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances

AWS Machine Learning

JULY 24, 2023

If the model changes on the server side, the client has to know and change its API call to the new endpoint accordingly. Based on these metrics an informed decision can be made. Clone the Github repository The GitHub repo provides all the scripts necessary to deploy models using FastAPI on NeuronCores on AWS Inferentia instances.

Scripts

Scripts APIs Best practices Engineering

Move Amazon SageMaker Autopilot ML models from experimentation to production using Amazon SageMaker Pipelines

AWS Machine Learning

NOVEMBER 1, 2022

Autopilot training jobs start their own dedicated SageMaker backend processes, and dedicated SageMaker API calls are required to start new training jobs, monitor training job statuses, and invoke trained Autopilot models. We use a Lambda step because the API call to Autopilot is lightweight. script creates an Autopilot job.

Scripts

Scripts APIs Metrics Engineering

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning

APRIL 29, 2024

This often means the method of using a third-party LLM API won’t do for security, control, and scale reasons. It provides an approachable, robust Python API for the full infrastructure stack of ML/AI, from data and compute to workflows and observability. The following figure illustrates this workflow.

APIs

APIs Engineering Scripts Management

Build an image search engine with Amazon Kendra and Amazon Rekognition

AWS Machine Learning

MAY 5, 2023

By uploading a small set of training images, Amazon Rekognition automatically loads and inspects the training data, selects the right ML algorithms, trains a model, and provides model performance metrics. A Python script is used to aid in the process of uploading the datasets and generating the manifest file.

Engineering

Engineering APIs Scripts Enterprise

Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

AWS Machine Learning

JULY 25, 2024

If it detects error messages specifically related to the Neuron device (which is the Trainium or AWS Inferentia chip), it will change NodeCondition to NeuronHasError on the Kubernetes API server. The node recovery agent is a separate component that periodically checks the Prometheus metrics exposed by the node problem detector.

Construction

Construction Metrics Scripts Engineering

Build a centralized monitoring and reporting solution for Amazon SageMaker using Amazon CloudWatch

AWS Machine Learning

AUGUST 10, 2023

SageMaker services, such as Processing, Training, and Hosting, collect metrics and logs from the running instances and push them to users’ Amazon CloudWatch accounts. One example is performing a metric query on the SageMaker job host’s utilization metrics when a job completion event is received.

Scripts

Scripts Accountability Metrics APIs

Amazon SageMaker with TensorBoard: An overview of a hosted TensorBoard experience

AWS Machine Learning

MAY 10, 2023

It provides a suite of tools for visualizing training metrics, examining model architectures, exploring embeddings, and more. When they create a SageMaker training job, domain users can use TensorBoard using the SageMaker Python SDK or Boto3 API. is your training script, and simple_tensorboard.ipynb launches the SageMaker training job.

Scripts

Scripts Metrics Construction APIs

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning

SEPTEMBER 26, 2024

This text-to-video API generates high-quality, realistic videos quickly from text and images. Customizable environment – SageMaker HyperPod offers the flexibility to customize your cluster environment using lifecycle scripts. Video generation has become the latest frontier in AI research, following the success of text-to-image models.

Scripts

Scripts APIs Metrics Management

Optimize hyperparameters with Amazon SageMaker Automatic Model Tuning

AWS Machine Learning

NOVEMBER 25, 2022

Defining the right objective metric matching your task. When our tuning job is complete, we look at some of the methods available to explore the results, both via the AWS Management Console and programmatically via the AWS SDKs and APIs. Collects metrics and logs. Amazon SageMaker Automatic Model Tuning. Runs the training.

Scripts

Scripts APIs Metrics Construction

Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction

AWS Machine Learning

DECEMBER 21, 2022

A new optional parameter TableFormat can be set either interactively using Amazon SageMaker Studio or through code using the API or the SDK. The following code snippet shows you how to create a feature group using the Iceberg format and FeatureGroup.create API of the SageMaker SDK. You can find the sample script in GitHub.

Scripts

Scripts Engineering APIs Big data

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

AWS Machine Learning

JANUARY 19, 2024

The goal of NAS is to find the optimal architecture for a given problem by searching over a large set of candidate architectures using techniques such as gradient-free optimization or by optimizing the desired metrics. The performance of the architecture is typically measured using metrics such as validation loss. training.py ).

Metrics

Metrics Scripts Benchmark Enterprise

Host ML models on Amazon SageMaker using Triton: CV model with PyTorch backend

AWS Machine Learning

MAY 31, 2023

Triton with PyTorch backend The PyTorch backend is designed to run TorchScript models using the PyTorch C++ API. Alternatively, you can use ensemble models or business logic scripting. file in the workspace directory contains scripts to load and save a PyTorch model. client(service_name="sagemaker") runtime_sm_client = boto3.client("sagemaker-runtime")

APIs

APIs Scripts Engineering Accountability

Track your ML experiments end to end with Data Version Control and Amazon SageMaker Experiments

AWS Machine Learning

JULY 14, 2022

This post walks you through an example of how to track your experiments across code, data, artifacts, and metrics by using Amazon SageMaker Experiments in conjunction with Data Version Control (DVC). In each individual experiment, we track input and output artifacts, code, and metrics using SageMaker Experiments. SageMaker Experiments.

Scripts

Scripts Engineering Metrics APIs

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Trending Sources

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Unlocking insights and enhancing customer service: Intact’s transformative AI journey with AWS

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

How to decide between Amazon Rekognition image and video API for video moderation

Few-shot prompt engineering and fine-tuning for LLMs in Amazon Bedrock

Generate training data and cost-effectively train categorical models with Amazon Bedrock

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 1

Build an air quality anomaly detector using Amazon Lookout for Metrics

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

Testing times: testingRTC is the smart, synchronized, real-world scenario WebRTC testing solution for the times we live in.

Create a document lake using large-scale text extraction from documents with Amazon Textract

How Games24x7 transformed their retraining MLOps pipelines with Amazon SageMaker

How LotteON built a personalized recommendation system using Amazon SageMaker and MLOps

Open source observability for AWS Inferentia nodes within Amazon EKS clusters

Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock

Semantic image search for articles using Amazon Rekognition, Amazon SageMaker foundation models, and Amazon OpenSearch Service

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

Modernizing data science lifecycle management with AWS and Wipro

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

Automatically generate impressions from findings in radiology reports using generative AI on AWS

Scale AI training and inference for drug discovery through Amazon EKS and Karpenter

Best practices for load testing Amazon SageMaker real-time inference endpoints

Image classification model selection using Amazon SageMaker JumpStart

Use Amazon Titan models for image generation, editing, and searching

Enhancing LLM Capabilities with NeMo Guardrails on Amazon SageMaker JumpStart

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

Build an end-to-end MLOps pipeline for visual quality inspection at the edge – Part 2

Revolutionizing large language model training with Arcee and AWS Trainium

Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances

Move Amazon SageMaker Autopilot ML models from experimentation to production using Amazon SageMaker Pipelines

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Build an image search engine with Amazon Kendra and Amazon Rekognition

Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

Build a centralized monitoring and reporting solution for Amazon SageMaker using Amazon CloudWatch

Amazon SageMaker with TensorBoard: An overview of a hosted TensorBoard experience

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

Optimize hyperparameters with Amazon SageMaker Automatic Model Tuning

­­Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

Host ML models on Amazon SageMaker using Triton: CV model with PyTorch backend

Track your ML experiments end to end with Data Version Control and Amazon SageMaker Experiments

Stay Connected

Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction