Benchmark, Examples and Scripts - Customer Contact Central

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

Using its enterprise software, FloTorch conducted an extensive comparison between Amazon Nova models and OpenAIs GPT-4o models with the Comprehensive Retrieval Augmented Generation (CRAG) benchmark dataset. The following table provides example questions with their domain and question type.

Benchmark

Benchmark APIs Enterprise Scripts

Call Center Metrics: Examples, Tips & Best Practices

Callminer

NOVEMBER 25, 2019

Call on experienced managers for guidance in setting up benchmarks. “Experienced call center managers are helpful in setting up the initial performance benchmarks for a new outbound call center program. These benchmarks are, at first, estimated based on the past performance of similar outbound call center projects.

Best practices

Best practices Call Center Metrics Average Handle Time

Accelerate digital pathology slide annotation workflows on AWS using H-optimus-0

AWS Machine Learning

JANUARY 31, 2025

This sets a new benchmark for state-of-the-art performance in critical medical diagnostic tasks, from identifying cancerous cells to detecting genetic abnormalities in tumors. Through practical examples, we show you how to adapt this FM to these specific use cases while optimizing computational resources.

Healthcare

Healthcare Scripts Benchmark SaaS

Live Chat: To script or not to script

RapportBoost

MAY 27, 2019

Chat scripts are a handy tool, especially for chat agents who find themselves often responding to related customer inquiries. Chat scripts, or canned responses, help companies ensure quality control, implement precise language for optimal results, and increase customer happiness. Not all companies implement chat scripts with success.

Scripts

Scripts Benchmark Customer Support Chatbots

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning

JULY 2, 2024

You can see that for the 45 models we benchmarked, there is a 1.35x latency improvement (geomean for the 45 models). You can see that for the 33 models we benchmarked, there is around 2x performance improvement (geomean for the 33 models). We benchmarked 45 models using the scripts from the TorchBench repo.

Benchmark

Benchmark Scripts Metrics APIs

25 Call Center Leaders Share the Most Effective Ways to Boost Contact Center Efficiency

Callminer

AUGUST 1, 2017

Example: Campaign A has a high call volume but campaign B has less calls and the agents that are assigned campaign B are not busy. Bill Dettering is the CEO and Founder of Zingtree , a SaaS solution for building interactive decision trees and agent scripts for contact centers (and many other industries). Bill Dettering.

Contact Center

Contact Center Call Center Average Handle Time Real estate

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning

FEBRUARY 12, 2025

For example, when tested on the MT-Bench dataset , the paper reports that Medusa-2 (the second version of Medusa) speeds up inference time by 2.8 For example, you can still use an ml.g5.4xlarge instance with 24 GB of GPU memory to host your 7-billion-parameter Llama or Mistral model with extra Medusa heads. times on the same dataset.

Scripts

Scripts Metrics Engineering Accountability

Achieving Excellence: Best Practices for Contact Center Performance and Quality Assurance

Hodusoft

NOVEMBER 6, 2024

” He gives the example of Apple’s products, which are noted for their beautiful design, functionalities, and aesthetics. Performance in a contact center refers to how effectively agents manage calls, resolve issues, and meet established benchmarks. Drexler said, “I’m looking for best practices constantly.”

Best practices

Best practices Contact Center Contact center software Abandon rate

Crafting Empathy and Personalization: Ingredients Of A Well-Balanced Customer Service Script

TeleDirect

OCTOBER 23, 2023

That’s where a customer service script comes into play. Call handling scripts outline the customer’s journey and prompt the person representing your company to create a memorable and consistent interaction with a client, customer or business associate. One simple example, “Thank you for calling (name of company).

Scripts

Scripts Personalization Customer Service Call Center

Automate the machine learning model approval process with Amazon SageMaker Model Registry and Amazon SageMaker Pipelines

AWS Machine Learning

AUGUST 7, 2024

In our example, the organization is willing to approve a model for deployment if it passes their checks for model quality, bias, and feature importance prior to deployment. For this example, we provide a centralized model. You can create and run the pipeline by following the example provided in the following GitHub repository.

Government

Government Benchmark Scripts Enterprise

Generate training data and cost-effectively train categorical models with Amazon Bedrock

AWS Machine Learning

MARCH 27, 2025

We also showcase a real-world example for predicting the root cause category for support cases. For the use case of labeling the support root cause categories, its often harder to source examples for categories such as Software Defect, Feature Request, and Documentation Improvement for labeling than it is for Customer Education.

Education

Education Engineering APIs Enterprise

Fine-tune large multimodal models using Amazon SageMaker

AWS Machine Learning

MAY 29, 2024

The prospect of fine-tuning open source multimodal models like LLaVA are highly appealing because of their cost effectiveness, scalability, and impressive performance on multimodal benchmarks. For example, instead of simply asking the model to describe the image, ask specific questions about the image and relating to its content.

Scripts

Scripts Healthcare Metrics Finance

Databricks DBRX is now available in Amazon SageMaker JumpStart

AWS Machine Learning

APRIL 26, 2024

When you select the option to use the SDK, you will see example code that you can use in the notebook editor of your choice in SageMaker Studio. In this section, we provide some example prompts and sample output. Code generation DBRX models demonstrate benchmarked strengths for coding tasks.

Transportation

Transportation Scripts Accountability Benchmark

What Is Average Handle Time (AHT) in the Contact Center? 5 Best Practices to Improve AHT

Calabrio

MARCH 27, 2025

For example, if a customer is waiting in line to speak with an agent for 30 minutes, that number isnt figured into the final AHT. Setting an Average Handle Time Benchmark: What is a Good AHT? For example, a caller may be asked On a scale of one to ten, how likely are you to recommend to a friend?

Average Handle Time

Average Handle Time Best practices Contact Center Metrics

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

AWS Machine Learning

NOVEMBER 30, 2023

For example, for PyTorch this would be a model.pth. Note that your model artifacts also include an inference script for preprocessing and postprocessing. If you don’t provide an inference script, the default inference handlers for the container you have chosen will be implemented.

Benchmark

Benchmark APIs Scripts Engineering

10 effective tips for training WFH contact center agents

Talkdesk

MAY 12, 2020

It would also be helpful to give new hires information on which KPIs managers will assess, how these are tied to performance evaluations, and practical tips on how to hit their KPI benchmarks. Allow them to listen to recordings and also provide online scripts. Choose recordings that will help you demonstrate a specific point (i.e.,

Contact Center

Contact Center Schedule adherence At home agents Scripts

How Twilio used Amazon SageMaker MLOps pipelines with PrestoDB to enable frequent model retraining and optimized batch transform

AWS Machine Learning

JUNE 17, 2024

The following code shows an example of how a query is configured within the config.yml file. For more information on the TPC-H data, its database entities, relationships, and characteristics, refer to TPC Benchmark H. The sklearn_processor is used in the ProcessingStep to run the scikit-learn script that preprocesses data.

Scripts

Scripts Engineering Metrics Big data

Customer Success in SaaS: A Complete Guide & Best Practices

Totango

JUNE 24, 2022

For example, a B2C customer might prioritize user experience, while a B2B client might emphasize return on investment. SaaS success outcomes can be defined in terms of measurable digital benchmarks. For example, a customer who is slow to complete the onboarding process can be sent an email prompt with a link to tutorial tips.

Best practices

Best practices SaaS Journey mapping B2C

Optimized PyTorch 2.0 inference with AWS Graviton processors

AWS Machine Learning

MAY 3, 2023

We have seen a similar trend in the price-performance advantage for other workloads on Graviton, for example video encoding with FFmpeg. wheels and set the previously mentioned environment variables # Clone PyTorch benchmark repo git clone [link] # Setup Resnet50 benchmark cd benchmark python3 install.py

Benchmark

Benchmark Best practices Scripts Management

Gemma is now available in Amazon SageMaker JumpStart

AWS Machine Learning

MARCH 13, 2024

You will also find a Deploy button, which takes you to a landing page where you can test inference with an example payload. Deploy Gemma with SageMaker Python SDK You can find the code showing the deployment of Gemma on JumpStart and an example of how to use the deployed model in this GitHub notebook. This looks pretty good!

Benchmark

Benchmark Scripts APIs Feedback

Service Standards and Service Excellence….are Not the Same Thing!

Up Your Service

OCTOBER 19, 2015

For example, establishing expected follow-up time and communications format when an IT department responds to a technical support call. In such cases, standards provide a useful benchmark, especially for new employees learning how to do the job. Consider this example. Standards can also support your brand.

Technical Support

Technical Support Education Scripts Benchmark

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

AWS Machine Learning

FEBRUARY 12, 2025

If youre running this code using an Amazon SageMaker notebook instance, edit the IAM role thats attached to the notebook (for example, AmazonSageMaker-ExecutionRole-XXX) instead of creating a new role. The following table shows an example. preference) and the fine-tuned model with original 500 examples (76.4% preference).

APIs

APIs Management Benchmark Scripts

Accelerate Amazon SageMaker inference with C6i Intel-based Amazon EC2 instances

AWS Machine Learning

MARCH 20, 2023

In the following example figure, we show INT8 inference performance in C6i for a BERT-base model. Refer to the appendix for instance details and benchmark data. Use the supplied Python scripts for quantization. Run the provided Python test scripts to invoke the SageMaker endpoint for both INT8 and FP32 versions.

Calibration

Calibration Scripts Benchmark APIs

Improve price performance of your model training using Amazon SageMaker heterogeneous clusters

AWS Machine Learning

OCTOBER 27, 2022

Our benchmarks show up to 46% price performance benefit after enabling heterogeneous clusters in a CPU-bound TensorFlow computer vision model training. Performance benchmark results. For example, consider a powerful GPU instance type, ml.p4d.24xlarge For example, if you’re training the model on ml.trn1.32xlarge, ml.p4d.24xlarge,

Scripts

Scripts Benchmark Metrics Transportation

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

AWS Machine Learning

AUGUST 18, 2022

The team’s early benchmarking results show 7.3 The baseline model used in these benchmarking is a multi-layer perceptron neural network with seven dense fully connected layers and over 200 parameters. The following table summarizes the benchmarking result on ml.p3.16xlarge SageMaker training instances. Number of Instances.

Scripts

Scripts APIs Benchmark Engineering

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

AWS Machine Learning

MAY 22, 2024

For example, a product that has a description that includes words such as “long sleeve” and “cotton neck” will be returned if a consumer is looking for a “long sleeve cotton shirt.” To fine-tune our model, we need to convert our structured examples into a collection of question and answer pairs. We prepared entrypoint_vqa_finetuning.py

Scripts

Scripts Engineering Accountability Benchmark

Scaling distributed training with AWS Trainium and Amazon EKS

AWS Machine Learning

FEBRUARY 1, 2023

The Neuron SDK is the software stack that provides the driver, compiler, runtime, framework integration (for example, PyTorch Neuron), and user tools that allow you to access the benefits of the Trainium accelerators. For example: apiVersion: eksctl.io/v1alpha5 An ECR repository is used to store the training container images.

Scripts

Scripts Benchmark Management Enterprise

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

AWS Machine Learning

JULY 26, 2023

We have examples available for Stable Diffusion 2.1 This notebook presents an end-to-end example of how to compile a Stable Diffusion model, save the compiled Neuron models, and load it into the runtime for inference. The emb tensor is used as example input for the torch_neuronx.trace function. model on the GitHub repo.

Scripts

Scripts APIs Benchmark Engineering

New technical deep dive course: Generative AI Foundations on AWS

AWS Machine Learning

JULY 26, 2023

We’ll cover fine-tuning your foundation models, evaluating recent techniques, and understanding how to run these with your scripts and models. All of the example notebooks and supporting code will ship in a public repository, which you can use to step through on your own. Want to jump right into the code?

Scripts

Scripts Engineering Benchmark Best practices

How Games24x7 transformed their retraining MLOps pipelines with Amazon SageMaker

AWS Machine Learning

APRIL 12, 2023

The code to invoke the pipeline script is available in the Studio notebooks, and we can change the hyperparameters and input/output when invoking the pipeline. This is quite different from our earlier method where we had all the parameters hard coded within the scripts and all the processes were inextricably linked.

Scripts

Scripts APIs Metrics Best practices

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

AWS Machine Learning

FEBRUARY 16, 2023

In this post, we use a Hugging Face BERT-Large model pre-training workload as a simple example to explain how to useTrn1 UltraClusters. The following diagram shows an example. You can invoke neuron-top during the training script run to inspect NeuronCore utilization at each node. Complete instructions can be found on GitHub.

Scripts

Scripts Benchmark APIs Engineering

Reduce Amazon SageMaker inference cost with AWS Graviton

AWS Machine Learning

MAY 10, 2023

We cover computer vision (CV), natural language processing (NLP), classification, and ranking scenarios for models and ml.c6g, ml.c7g, ml.c5, and ml.c6i SageMaker instances for benchmarking. You can use the sample notebook to run the benchmarks and reproduce the results. Create an endpoint configuration.

Benchmark

Benchmark Best practices Engineering Scripts

Best practices for load testing Amazon SageMaker real-time inference endpoints

AWS Machine Learning

JANUARY 10, 2023

We first benchmark the performance of our model on a single instance to identify the TPS it can handle per our acceptable latency requirements. The entire set of code for the example is available in the following GitHub repository. Overview of solution. For CPUUtilization , you may see percentages above 100% at first in CloudWatch.

Best practices

Best practices Scripts APIs Metrics

AlexaTM 20B is now available in Amazon SageMaker JumpStart

AWS Machine Learning

NOVEMBER 17, 2022

It can be applied even when there are only a few available training examples, or even none at all. AlexaTM 20B has shown competitive performance on common natural language processing (NLP) benchmarks and tasks, such as machine translation, data generation and summarization. This is known as in-context learning.

Scripts

Scripts APIs Government Engineering

Enable faster training with Amazon SageMaker data parallel library

AWS Machine Learning

DECEMBER 5, 2023

In this post, we show a high-level overview of how SMDDP works, how you can enable SMDDP in your Amazon SageMaker training scripts, and the performance improvements you can expect. You can find more SMDDP examples with sharded data parallel training in the Amazon SageMaker Examples GitHub repository.

Benchmark

Benchmark Engineering Scripts Metrics

11 Types of Bad Customer Service (and How To Avoid Them)

Help Scout

JULY 13, 2021

Read Email Response Times: Benchmarks and Tips for Support for practical advice. It was the bots that seemed to offer help but failed to deliver it, like this Flowxo example: Be the best bot you can be: Look for alternatives to chat bots that will still meet your customer service goals. You’ll spot the rough edges more easily.

Customer Service

Customer Service Scripts Chatbots Airlines

Train gigantic models with near-linear scaling using sharded data parallelism on Amazon SageMaker

AWS Machine Learning

OCTOBER 31, 2022

For example, 8 V100 GPUs (32GB each) are sufficient to hold the model states replica of a 10B-parameter model which needs about 200GB of memory when training with Adam optimizer using mixed-precision. To get started, follow Modify a PyTorch Training Script to adapt SMPs’ APIs in your training script. Benchmarking performance.

Scripts

Scripts Benchmark APIs Engineering

Image classification model selection using Amazon SageMaker JumpStart

AWS Machine Learning

FEBRUARY 6, 2023

The former question addresses model selection across model architectures, while the latter question concerns benchmarking trained models against a test dataset. This post provides details on how to implement large-scale Amazon SageMaker benchmarking and model selection tasks. swin-large-patch4-window7-224 195.4M efficientnet-b5 29.0M

APIs

APIs Scripts Metrics Benchmark

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning

SEPTEMBER 26, 2024

Furthermore, as clusters scale to larger sizes (for example, more than 32 nodes), they require built-in resiliency mechanisms such as automated faulty node detection and replacement to improve cluster goodput and maintain efficient operations. For example, when working with a smaller backbone model like Stable Diffusion 1.5,

Scripts

Scripts APIs Metrics Management

How to Build a Chatbot in 8 Steps without Coding

Comm100

JULY 5, 2022

For a university or college, for example, intents might include: . For example, a student wishing to pay fees might say “I’d like to pay my bill” or ask, “how can I pay my tuition online?” Over time, your chatbot will learn to correctly identify new examples of each intent outside of the original programming.

Chatbots

Chatbots Benchmark Scripts Customer Service

Integrate HyperPod clusters with Active Directory for seamless multi-user login

AWS Machine Learning

APRIL 22, 2024

To achieve this multi-user environment, you can take advantage of Linux’s user and group mechanism and statically create multiple users on each instance through lifecycle scripts. For Directory DNS name , enter your preferred directory DNS name (for example, hyperpod.abc123.com For Directory type , select AWS Managed Microsoft AD.

Scripts

Scripts Engineering Management Benchmark

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

AWS Machine Learning

JANUARY 19, 2024

One example is an online retailer who deploys a large number of inference endpoints for text summarization, product catalog classification, and product feedback sentiment classification. We use the Recognizing Textual Entailment dataset from the GLUE benchmarking suite. Choose System terminal under Utilities and files. training.py ).

Metrics

Metrics Scripts Benchmark Enterprise

How to Use the CSAT Metric in Your CX Program

GetFeedback

AUGUST 14, 2019

See the example below. Flip the script on your results and use that as a motivator. After inbound sales calls, for example, prospects can share how satisfied they were with the conversation. For example, a journey map may identify customers feel their invoices are too long and hard to understand. Ways to use CSAT .

Metrics

Metrics Journey mapping Surveys Coaching

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

AWS Machine Learning

JUNE 6, 2023

The following figure shows a performance benchmark of fine-tuning a RoBERTa model on Amazon EC2 p4d.24xlarge inference with AWS Graviton processors for details on AWS Graviton-based instance inference performance benchmarks for PyTorch 2.0. Run your DLC container with a model training script to fine-tune the RoBERTa model.

Scripts

Scripts APIs Benchmark Management

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Call Center Metrics: Examples, Tips & Best Practices

Trending Sources

Accelerate digital pathology slide annotation workflows on AWS using H-optimus-0

Live Chat: To script or not to script

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

25 Call Center Leaders Share the Most Effective Ways to Boost Contact Center Efficiency

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Achieving Excellence: Best Practices for Contact Center Performance and Quality Assurance

Crafting Empathy and Personalization: Ingredients Of A Well-Balanced Customer Service Script

Automate the machine learning model approval process with Amazon SageMaker Model Registry and Amazon SageMaker Pipelines

Generate training data and cost-effectively train categorical models with Amazon Bedrock

Fine-tune large multimodal models using Amazon SageMaker

Databricks DBRX is now available in Amazon SageMaker JumpStart

What Is Average Handle Time (AHT) in the Contact Center? 5 Best Practices to Improve AHT

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

10 effective tips for training WFH contact center agents

How Twilio used Amazon SageMaker MLOps pipelines with PrestoDB to enable frequent model retraining and optimized batch transform

Customer Success in SaaS: A Complete Guide & Best Practices

Optimized PyTorch 2.0 inference with AWS Graviton processors

Gemma is now available in Amazon SageMaker JumpStart

Service Standards and Service Excellence….are Not the Same Thing!

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

Accelerate Amazon SageMaker inference with C6i Intel-based Amazon EC2 instances

Improve price performance of your model training using Amazon SageMaker heterogeneous clusters

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

Scaling distributed training with AWS Trainium and Amazon EKS

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

New technical deep dive course: Generative AI Foundations on AWS

How Games24x7 transformed their retraining MLOps pipelines with Amazon SageMaker

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

Reduce Amazon SageMaker inference cost with AWS Graviton

Best practices for load testing Amazon SageMaker real-time inference endpoints

AlexaTM 20B is now available in Amazon SageMaker JumpStart

Enable faster training with Amazon SageMaker data parallel library

11 Types of Bad Customer Service (and How To Avoid Them)

Train gigantic models with near-linear scaling using sharded data parallelism on Amazon SageMaker

Image classification model selection using Amazon SageMaker JumpStart

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

How to Build a Chatbot in 8 Steps without Coding

Integrate HyperPod clusters with Active Directory for seamless multi-user login

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

How to Use the CSAT Metric in Your CX Program

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

Stay Connected