2024, APIs and Metrics - Customer Contact Central

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

AWS Machine Learning

DECEMBER 2, 2024

Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference endpoints to zero instances. These sub-minute metrics can help trigger scale-out actions more precisely, reducing the number of NoCapacityInvocationFailures your users might experience.

APIs

APIs Best practices Metrics Engineering

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

OpenAI launched GPT-4o in May 2024, and Amazon introduced Amazon Nova models at AWS re:Invent in December 2024. How do Amazon Nova Micro and Amazon Nova Lite perform against GPT-4o mini in these same metrics? Amazon Bedrock APIs make it straightforward to use Amazon Titan Text Embeddings V2 for embedding data.

Benchmark

Benchmark APIs Enterprise Scripts

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning

NOVEMBER 19, 2024

Gain insights into training strategies, productivity metrics, and real-world use cases to empower your developers to harness the full potential of this game-changing technology. Discover how to create and manage evaluation jobs, use automatic and human reviews, and analyze critical metrics like accuracy, robustness, and toxicity.

APIs

APIs Enterprise Best practices Government

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

AWS Machine Learning

NOVEMBER 15, 2024

In business for 145 years, Principal is helping approximately 64 million customers (as of Q2, 2024) plan, protect, invest, and retire, while working to support the communities where it does business and build a diverse, inclusive workforce. The platform has delivered strong results across several key metrics.

Chatbots

Chatbots Engineering Enterprise Government

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

AWS Machine Learning

AUGUST 2, 2024

adds new APIs to customize GraphStorm pipelines: you now only need 12 lines of code to implement a custom node classification training loop. To help you get started with the new API, we have published two Jupyter notebook examples: one for node classification, and one for a link prediction task. Specifically, GraphStorm 0.3

APIs

APIs Benchmark Construction Enterprise

How Formula 1® uses generative AI to accelerate race-day issue resolution

AWS Machine Learning

FEBRUARY 18, 2025

During these live events, F1 IT engineers must triage critical issues across its services, such as network degradation to one of its APIs. This impacts downstream services that consume data from the API, including products such as F1 TV, which offer live and on-demand coverage of every race as well as real-time telemetry.

APIs

APIs Engineering Metrics Big data

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

AWS Machine Learning

MARCH 18, 2025

Additionally, the complexity increases due to the presence of synonyms for columns and internal metrics available. I am creating a new metric and need the sales data. Firstly, LLMs dont have access to enterprise databases, and the models need to be customized to understand the specific database of an enterprise.

Enterprise

Enterprise Chatbots Engineering Metrics

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

AWS Machine Learning

JANUARY 28, 2025

During re:Invent 2024, we launched latency-optimized inference for foundation models (FMs) in Amazon Bedrock. To effectively optimize AI applications for responsiveness, we need to understand the key metrics that define latency and how they impact user experience. These metrics are shown in the following diagram.

Benchmark

Benchmark APIs Engineering Metrics

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

AWS Machine Learning

FEBRUARY 24, 2025

The 2501 version follows previous iterations (Mistral-Small-2409 and Mistral-Small-2402) released in 2024, incorporating improvements in instruction-following and reliability. At the time of writing this post, you can use the InvokeModel API to invoke the model. It doesnt support Converse APIs or other Amazon Bedrock tooling.

APIs

APIs Enterprise Benchmark Feedback

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

AWS Machine Learning

DECEMBER 2, 2024

At re:Invent 2024, we are excited to announce new capabilities to speed up your AI inference workloads with NVIDIA accelerated computing and software offerings on Amazon SageMaker. This post is co-written with Abhishek Sawarkar, Eliuth Triana, Jiahong Liu and Kshitiz Gupta from NVIDIA.

Enterprise

Enterprise Benchmark Technology APIs

Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

AWS Machine Learning

JULY 25, 2024

If it detects error messages specifically related to the Neuron device (which is the Trainium or AWS Inferentia chip), it will change NodeCondition to NeuronHasError on the Kubernetes API server. The node recovery agent is a separate component that periodically checks the Prometheus metrics exposed by the node problem detector.

Construction

Construction Metrics Scripts Engineering

Governing the ML lifecycle at scale: Centralized observability with Amazon SageMaker and Amazon CloudWatch

AWS Machine Learning

OCTOBER 29, 2024

SageMaker Model Monitor emits per-feature metrics to Amazon CloudWatch , which you can use to set up dashboards and alerts. You can use cross-account observability in CloudWatch to search, analyze, and correlate cross-account telemetry data stored in CloudWatch such as metrics, logs, and traces from one centralized account.

Government

Government Metrics Accountability APIs

Secure AccountantAI Chatbot: Lili’s journey with Amazon Bedrock

AWS Machine Learning

JULY 18, 2024

The ingestion workflow transforms these curated questions into vector embeddings using Amazon Titan Text Embeddings model API. Specific accounting knowledge that is relevant to the question and the model is not familiar with, such as updated data for 2024. The vector embeddings are persisted in the application in-memory vector store.

Chatbots

Chatbots APIs Accountability Finance

Introducing Amazon EKS support in Amazon SageMaker HyperPod

AWS Machine Learning

SEPTEMBER 11, 2024

Amazon EKS creates a highly available endpoint for the managed Kubernetes API server that you use to communicate with your cluster (using tools like kubectl). The managed endpoint uses Network Load Balancer to load balance Kubernetes API servers. This VPC doesn’t appear in the customer account. Replace the Instance.

APIs

APIs Accountability Metrics Management

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1

AWS Machine Learning

OCTOBER 2, 2024

In addition, they use the developer-provided instruction to create an orchestration plan and then carry out the plan by invoking company APIs and accessing knowledge bases using Retrieval Augmented Generation (RAG) to provide an answer to the user’s request. user id 111 Today: 09/03/2024 Certainly! Your appointment ID is XXXX.

Best practices

Best practices APIs Metrics Accountability

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning

APRIL 29, 2024

In 2024, however, organizations are using large language models (LLMs), which require relatively little focus on NLP, shifting research and development from modeling to the infrastructure needed to support LLM workflows. This often means the method of using a third-party LLM API won’t do for security, control, and scale reasons.

APIs

APIs Engineering Scripts Management

Automating model customization in Amazon Bedrock with AWS Step Functions workflow

AWS Machine Learning

JULY 11, 2024

The workflow invokes the Amazon Bedrock CreateModelCustomizationJob API synchronously to fine tune the base model with the training data from the S3 bucket and the passed-in hyperparameters. An AWS Lambda function is called to evaluate the quality of the summarization done by custom model and the base model using the BERTScore metric.

APIs

APIs Accountability Enterprise Best practices

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

AWS Machine Learning

AUGUST 26, 2024

From the period of September 2023 to March 2024, sellers leveraging GenAI Account Summaries saw a 4.9% This involves benchmarking new models against our current selections across various metrics, running A/B tests, and gradually incorporating high-performing models into our production pipeline. The impact goes beyond just efficiency.

Sales

Sales Accountability Feedback Metrics

Transform customer engagement with no-code LLM fine-tuning using Amazon SageMaker Canvas and SageMaker JumpStart

AWS Machine Learning

MAY 10, 2024

Validation loss and validation perplexity – Similar to the training metrics, but measured during the validation stage. On an order of magnitude, the two models performed about the same along these metrics on the provided data. Training perplexity – Measures the model’s surprise when encountering text during training.

APIs

APIs Metrics SaaS Accountability

How Twitch used agentic workflow with RAG on Amazon Bedrock to supercharge ad sales

AWS Machine Learning

DECEMBER 13, 2024

In early 2024, Amazon launched a major push to harness the power of Twitch for advertisers globally. It evaluates each user query to determine the appropriate course of action, whether refusing to answer off-topic queries, tapping into the LLM, or invoking APIs and data sources such as the vector database.

Sales

Sales Advertising Engineering APIs

Unleashing the power of generative AI: Verisk’s journey to an Instant Insight Engine for enhanced customer support

AWS Machine Learning

MAY 9, 2024

FAST has earned a fourth consecutive leader ranking in the 2024 ISG Provider Lens report for its seamless integration with Verisk’s data, analytics, and claims tools. Through some slick prompt engineering and using Claude’s latest capabilities to invoke APIs, Verisk seamlessly accessed their database to procure real-time information.

Engineering

Engineering Customer Support APIs Government

Mixtral 8x22B is now available in Amazon SageMaker JumpStart

AWS Machine Learning

MAY 17, 2024

SageMaker has seamless logging, monitoring, and auditing enabled for deployed models with native integrations with services like AWS CloudTrail for logging and monitoring to provide insights into API calls and Amazon CloudWatch to collect metrics, logs, and event data to provide information into the model’s resource utilization.

Benchmark

Benchmark APIs Personalization Enterprise

Top 5 Call Center Quality Assurance Software for 2024

Balto

AUGUST 23, 2024

Here’s the good news: in 2024, we have a wide array of capable call center quality assurance software solutions that can streamline QA processes, automate manual tasks, and deliver insightful reports to support decision-making. The post Top 5 Call Center Quality Assurance Software for 2024 appeared first on Balto.

Call Center

Call Center Coaching Surveys Contact Center

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

AWS Machine Learning

MAY 2, 2024

As new embedding models are released with incremental quality improvements, organizations must weigh the potential benefits against the associated costs of upgrading, considering factors like computational resources, data reprocessing, integration efforts, and projected performance gains impacting business metrics.

Benchmark

Benchmark Metrics Enterprise APIs

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

AWS Machine Learning

JULY 9, 2024

These applications require the LLMs to have requisite domain knowledge and be able to reason about numeric data to calculate metrics and extract insights. Sonnet is currently ranked number one (as of July 2024), demonstrating Anthropic’s strengths in the business and finance domain. Anthropic Claude 3.5

Finance

Finance Benchmark industry standards Accountability

The executive’s guide to generative AI for sustainability

AWS Machine Learning

APRIL 22, 2024

Figure 1: Examples of generative AI for sustainability use cases across the value chain According to KPMG’s 2024 ESG Organization Survey , investment in ESG capabilities is another top priority for executives as organizations face increasing regulatory pressure to disclose information about ESG impacts, risks, and opportunities.

Best practices

Best practices Benchmark Transportation Engineering

Your guide to generative AI and ML at AWS re:Invent 2023

AWS Machine Learning

NOVEMBER 22, 2023

Also learn how prompts can be integrated with your architecture and how to use API parameters for tuning the model parameters using Amazon Bedrock. See demos on how to build analytics dashboards and integrations between LLMs and Amazon QuickSight to visualize your key metrics. Reserve your seat now! Reserve your seat now!

Engineering

Engineering Best practices APIs Government

Fine-tune Meta Llama 3.1 models using torchtune on Amazon SageMaker

AWS Machine Learning

SEPTEMBER 19, 2024

Users initiate the process by calling the SageMaker control plane through APIs or command line interface (CLI) or using the SageMaker SDK for each individual step. Create a Weights & Biases API key to access the Weights & Biases dashboard for logging and monitoring Request a SageMaker service quota for 1x ml.p4d.24xlarge

Engineering

Engineering APIs Scripts Metrics

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers

AWS Machine Learning

APRIL 8, 2024

In January 2024, Amazon SageMaker launched a new version (0.26.0) You can enable your desired strategy ( shard-over-heads , for example) with the following code: option.group_query_attention=shard-over-heads Additionally, the new implementation of NeuronX DLC introduces a cache API for TransformerNeuronX that enables access to the KV cache.

Engineering

Engineering Calibration APIs Enterprise

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning

NOVEMBER 16, 2023

Users from several business units were trained and onboarded to the platform, and that number is expected to grow in 2024. Another important metric is the efficiency for data science users. The following outcomes were achieved: User adoption is one of the key leading indicators for Philips.

Healthcare

Healthcare Government Engineering APIs

Patient Engagement Mobile Apps: On Guard of Health

CSM Magazine

MARCH 5, 2024

from 2024, fueled by several trends, including the use of smartphones. Enhanced self-management: the patient mobile app can help patients track their symptoms, adherence to treatment, and other health metrics. Health data tracking: integrate with wearables and devices to track vital signs, medications, and other health metrics.

Healthcare

Healthcare Education Gamification Consulting

A review of purpose-built accelerators for financial services

AWS Machine Learning

SEPTEMBER 11, 2024

In terms of resulting speedups, the approximate order is programming hardware, then programming against PBA APIs, then programming in an unmanaged language such as C++, then a managed language such as Python. In March 2024, AWS announced it will offer the new NVIDIA Blackwell platform, featuring the new GB200 Grace Blackwell chip.

Benchmark

Benchmark Banking Analytics Big data

How Clearwater Analytics is revolutionizing investment management with generative AI and Amazon SageMaker JumpStart

AWS Machine Learning

DECEMBER 13, 2024

With SageMaker JumpStart, you can evaluate, compare, and select foundation models (FMs) quickly based on predefined quality and responsibility metrics to perform tasks such as article summarization and image generation. Crystal shares CWICs core functionalities but benefits from broader data sources and API access.

Analytics

Analytics Management Accountability Engineering

A Comprehensive Guide to Virtual Call Center and Contact Centers

Hodusoft

MAY 10, 2024

As per recent stats the number of remote call center agents is expected to grow by 60 percent from 2022 to 2024. Consider APIs and third-party integrations available to extend functionality as needed. It just takes one natural disaster or pandemic to stop the entire call center operations for quite some time.

virtual call center

virtual call center Call Center Contact Center call center software

7 Ways to Automate Customer Service (Without Sacrificing Quality)

JivoChat

APRIL 13, 2021

Chatbots will drive $142 billion in consumer spending by 2024 — a meteoric surge from $2.8 Live chat could nudge them to decide based on a current performance metric that might be relevant to their needs. You can even build custom automated support solutions with an API. billion in 2019. Source: JivoChat.

Customer Service

Customer Service Chatbots Self service Abandon rate

Safeguard a generative AI travel agent with prompt engineering and Guardrails for Amazon Bedrock

AWS Machine Learning

JUNE 18, 2024

Guardrail objectives At the core of the architecture is Amazon Bedrock serving foundation models (FMs) with an API interface; the FM powers the conversational capabilities of the virtual agent. Designing a personalized CloudWatch dashboard involves the use of metric filters to extract targeted insights from logs. Virginia) AWS Region.

Engineering

Engineering Virtual Agent Finance Government

Amazon Bedrock Custom Model Import now generally available

AWS Machine Learning

OCTOBER 21, 2024

This feature empowers customers to import and use their customized models alongside existing foundation models (FMs) through a single, unified API. Having a unified developer experience when accessing custom models or base models through Amazon Bedrock’s API. Ease of deployment through a fully managed, serverless, service. 2, 3, 3.1,

APIs

APIs Scripts Finance Real estate

A guide to Amazon Bedrock Model Distillation (preview)

AWS Machine Learning

DECEMBER 4, 2024

In a production environment, you continue to use the existing Amazon Bedrock Inference APIs, such as the InvokeModel or Converse API, and turn on invocation logs that store model input data (prompts) and model output data (responses). The record can optionally include a system prompt that indicates the role assigned to the model.

APIs

APIs Metrics Healthcare Chatbots

Customize Amazon Nova models to improve tool usage

AWS Machine Learning

APRIL 28, 2025

However, as industries require more adaptive, decision-making AI, integrating tools and external APIs has become essential. Expanding LLM capabilities with tool use LLMs excel at natural language tasks but become significantly more powerful with tool integration, such as APIs and computational frameworks.

APIs

APIs Benchmark Scripts Metrics

Reducing hallucinations in large language models with custom intervention using Amazon Bedrock Agents

AWS Machine Learning

NOVEMBER 26, 2024

Amazon Bedrock Guardrails offer hallucination detection with contextual grounding checks, which can be seamlessly applied using Amazon Bedrock APIs (such as Converse or InvokeModel ) or embedded into workflows. The custom hallucination detector uses RAGAS metrics, which are generated using a CSV file containing question-answer pairs.

APIs

APIs Chatbots Metrics Finance

Parameta accelerates client email resolution with Amazon Bedrock Flows

AWS Machine Learning

JANUARY 7, 2025

Amazon Bedrock Flows provide a powerful, low-code solution for creating complex generative AI workflows with an intuitive visual interface and with a set of APIs in the Amazon Bedrock SDK. Orchestration Amazon Bedrock Flows serves as the central orchestrator, managing the entire email processing pipeline.

Technical Support

Technical Support APIs Government Analytics

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

AWS Machine Learning

NOVEMBER 22, 2024

Implementation details We spin up the cluster by calling the SageMaker control plane through APIs or the AWS Command Line Interface (AWS CLI) or using the SageMaker AWS SDK. Optional) Create a Weights & Biases API key to access the Weights & Biases dashboard for logging and monitoring. on Hugging Face.

Scripts

Scripts APIs Construction Engineering

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning

APRIL 30, 2025

Agent function calling represents a critical capability for modern AI applications, allowing models to interact with external tools, databases, and APIs by accurately determining when and how to invoke specific functions. Evaluation metric We use abstract syntax tree (AST) to evaluate the function calling performance.

APIs

APIs Construction Engineering Entertainment

DXC transforms data exploration for their oil and gas customers with LLM-powered tools

AWS Machine Learning

NOVEMBER 18, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies such as AI21 Labs, Anthropic, Cohere, Meta, Mistral, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

APIs

APIs Surveys Chatbots Construction

Unlock cost savings with the new scale down to zero feature in SageMaker Inference

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Trending Sources

Your guide to generative AI and ML at AWS re:Invent 2024

Principal Financial Group uses QnABot on AWS and Amazon Q Business to enhance workforce productivity with generative AI

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

How Formula 1® uses generative AI to accelerate race-day issue resolution

Build your gen AI–based text-to-SQL application using RAG, powered by Amazon Bedrock (Claude 3 Sonnet and Amazon Titan for embedding)

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

Governing the ML lifecycle at scale: Centralized observability with Amazon SageMaker and Amazon CloudWatch

Secure AccountantAI Chatbot: Lili’s journey with Amazon Bedrock

Introducing Amazon EKS support in Amazon SageMaker HyperPod

Best practices for building robust generative AI applications with Amazon Bedrock Agents – Part 1

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Automating model customization in Amazon Bedrock with AWS Step Functions workflow

AWS empowers sales teams using generative AI solution built on Amazon Bedrock

Transform customer engagement with no-code LLM fine-tuning using Amazon SageMaker Canvas and SageMaker JumpStart

How Twitch used agentic workflow with RAG on Amazon Bedrock to supercharge ad sales

Unleashing the power of generative AI: Verisk’s journey to an Instant Insight Engine for enhanced customer support

Mixtral 8x22B is now available in Amazon SageMaker JumpStart

Top 5 Call Center Quality Assurance Software for 2024

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

The executive’s guide to generative AI for sustainability

Your guide to generative AI and ML at AWS re:Invent 2023

Fine-tune Meta Llama 3.1 models using torchtune on Amazon SageMaker

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

Patient Engagement Mobile Apps: On Guard of Health

A review of purpose-built accelerators for financial services

How Clearwater Analytics is revolutionizing investment management with generative AI and Amazon SageMaker JumpStart

A Comprehensive Guide to Virtual Call Center and Contact Centers

7 Ways to Automate Customer Service (Without Sacrificing Quality)

Safeguard a generative AI travel agent with prompt engineering and Guardrails for Amazon Bedrock

Amazon Bedrock Custom Model Import now generally available

A guide to Amazon Bedrock Model Distillation (preview)

Customize Amazon Nova models to improve tool usage

Reducing hallucinations in large language models with custom intervention using Amazon Bedrock Agents

Parameta accelerates client email resolution with Amazon Bedrock Flows

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

DXC transforms data exploration for their oil and gas customers with LLM-powered tools

Stay Connected