APIs and Benchmark - Customer Contact Central

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

Using its enterprise software, FloTorch conducted an extensive comparison between Amazon Nova models and OpenAIs GPT-4o models with the Comprehensive Retrieval Augmented Generation (CRAG) benchmark dataset. FloTorch used these queries and their ground truth answers to create a subset benchmark dataset.

Benchmark

Benchmark APIs Enterprise Scripts

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

AWS Machine Learning

AUGUST 2, 2024

adds new APIs to customize GraphStorm pipelines: you now only need 12 lines of code to implement a custom node classification training loop. Based on customer feedback for the experimental APIs we released in GraphStorm 0.2, introduces refactored graph ML pipeline APIs. Specifically, GraphStorm 0.3 In addition, GraphStorm 0.3

APIs

APIs Benchmark Construction Enterprise

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning

MARCH 3, 2025

Overview of Pixtral 12B Pixtral 12B, Mistrals inaugural VLM, delivers robust performance across a range of benchmarks, surpassing other open models and rivaling larger counterparts, according to Mistrals evaluation. Performance metrics and benchmarks Pixtral 12B is trained to understand both natural images and documents, achieving 52.5%

Benchmark

Benchmark APIs Enterprise Construction

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

AWS Machine Learning

JULY 9, 2024

Sonnet currently ranks at the top of S&P AI Benchmarks by Kensho , which assesses large language models (LLMs) for finance and business. For example, there could be leakage of benchmark datasets’ questions and answers into training data. Anthropic Claude 3.5 Kensho is the AI Innovation Hub for S&P Global. Anthropic Claude 3.5

Finance

Finance Benchmark industry standards Accountability

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart

AWS Machine Learning

JANUARY 29, 2024

This post explores these relationships via a comprehensive benchmarking of LLMs available in Amazon SageMaker JumpStart, including Llama 2, Falcon, and Mistral variants. We provide theoretical principles on how accelerator specifications impact LLM benchmarking. Additionally, models are fully sharded on the supported instance.

Benchmark

Benchmark APIs Enterprise Accountability

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

AWS Machine Learning

JANUARY 28, 2025

Consider benchmarking your user experience to find the best latency for your use case, considering that most humans cant read faster than 225 words per minute and therefore extremely fast response can hinder user experience. In such scenarios, you want to optimize for TTFT. Users prefer accurate responses over quick but less reliable ones.

Benchmark

Benchmark APIs Engineering Metrics

Speed up your time series forecasting by up to 50 percent with Amazon SageMaker Canvas UI and AutoML APIs

AWS Machine Learning

SEPTEMBER 28, 2023

In this post, we describe the enhancements to the forecasting capabilities of SageMaker Canvas and guide you on using its user interface (UI) and AutoML APIs for time-series forecasting. While the SageMaker Canvas UI offers a code-free visual interface, the APIs empower developers to interact with these features programmatically.

APIs

APIs Construction Finance Enterprise

LLM-as-a-judge on Amazon Bedrock Model Evaluation

AWS Machine Learning

FEBRUARY 12, 2025

Amazon Bedrock , a fully managed service offering high-performing foundation models from leading AI companies through a single API, has recently introduced two significant evaluation capabilities: LLM-as-a-judge under Amazon Bedrock Model Evaluation and RAG evaluation for Amazon Bedrock Knowledge Bases. 0]}-{evaluator_model.split('.')[0]}-{datetime.now().strftime('%Y-%m-%d-%H-%M-%S')}"

Metrics

Metrics Engineering Benchmark APIs

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

AWS Machine Learning

MARCH 6, 2025

Amazon Bedrock is a fully managed service that offers a choice of high-performing Foundation Models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon via a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.

Metrics

Metrics Enterprise APIs Engineering

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

AWS Machine Learning

FEBRUARY 24, 2025

Performance metrics and benchmarks According to Mistral, the instruction-tuned version of the model achieves over 81% accuracy on Massive Multitask Language Understanding (MMLU) with 150 tokens per second latency, making it currently the most efficient model in its category. It doesnt support Converse APIs or other Amazon Bedrock tooling.

APIs

APIs Enterprise Benchmark Feedback

Generate training data and cost-effectively train categorical models with Amazon Bedrock

AWS Machine Learning

MARCH 27, 2025

These include metrics such as ROUGE or cosine similarity for text similarity, and specific benchmarks for assessing toxicity (Detoxify), prompt stereotyping (cross-entropy loss), or factual knowledge (HELM, LAMA). Refer to Getting started with the API to set up your environment to make Amazon Bedrock requests through the AWS API.

Education

Education Engineering APIs Enterprise

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

AWS Machine Learning

DECEMBER 2, 2024

This integration provides a powerful multilingual model that excels in reasoning benchmarks. on benchmarks like XCOPA and TyDiQA, highlighting its versatility and efficiency. outperforming similarly sized models and demonstrating similar performance against larger ones such as Llama-2 34B.

Enterprise

Enterprise Benchmark Technology APIs

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

AWS Machine Learning

FEBRUARY 12, 2025

Amazon Bedrock is a fully managed service that makes FMs from leading AI startups and Amazon available through an API, so you can choose from a wide range of FMs to find the model that is best suited for your use case. Solution overview The solution comprises two main steps: Generate synthetic data using the Amazon Bedrock InvokeModel API.

APIs

APIs Management Benchmark Scripts

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 2

AWS Machine Learning

NOVEMBER 15, 2024

An alternative approach to routing is to use the native tool use capability (also known as function calling) available within the Bedrock Converse API. In this scenario, each category or data source would be defined as a ‘tool’ within the API, enabling the model to select and use these tools as needed.

APIs

APIs Engineering Chatbots Construction

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning

NOVEMBER 19, 2024

Seamlessly bring your fine-tuned models into a fully managed, serverless environment, and use the Amazon Bedrock standardized API and features like Amazon Bedrock Agents and Amazon Bedrock Knowledge Bases to accelerate generative AI application development.

APIs

APIs Enterprise Best practices Government

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

AWS Machine Learning

MAY 2, 2024

A common way to select an embedding model (or any model) is to look at public benchmarks; an accepted benchmark for measuring embedding quality is the MTEB leaderboard. The Massive Text Embedding Benchmark (MTEB) evaluates text embedding models across a wide range of tasks and datasets.

Benchmark

Benchmark Metrics Enterprise APIs

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

AWS Machine Learning

JULY 2, 2024

You can see that for the 45 models we benchmarked, there is a 1.35x latency improvement (geomean for the 45 models). You can see that for the 33 models we benchmarked, there is around 2x performance improvement (geomean for the 33 models). We benchmarked 45 models using the scripts from the TorchBench repo.

Benchmark

Benchmark Scripts Metrics APIs

Transition your Amazon Forecast usage to Amazon SageMaker Canvas

AWS Machine Learning

JULY 29, 2024

You can also either use the SageMaker Canvas UI, which provides a visual interface for building and deploying models without needing to write any code or have any ML expertise, or use its automated machine learning (AutoML) APIs for programmatic interactions.

APIs

APIs Scripts Benchmark Metrics

Build a secure enterprise application with Generative AI and RAG using Amazon SageMaker JumpStart

AWS Machine Learning

SEPTEMBER 6, 2023

These SageMaker endpoints are consumed in the Amplify React application through Amazon API Gateway and AWS Lambda functions. To protect the application and APIs from inadvertent access, Amazon Cognito is integrated into Amplify React, API Gateway, and Lambda functions. You access the React application from your computer.

Enterprise

Enterprise APIs Real estate Construction

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning

MARCH 3, 2025

They enable applications requiring very low latency or local data processing using familiar APIs and tool sets. Through comparative benchmarking tests, we illustrate how deploying FMs in Local Zones closer to end users can significantly reduce latencya critical factor for real-time applications such as conversational AI assistants.

APIs

APIs Benchmark Metrics Healthcare

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

AWS Machine Learning

MAY 15, 2024

Acting as a model hub, JumpStart provided a large selection of foundation models and the team quickly ran their benchmarks on candidate models. Regarding the inference, customers using Amazon Ads now have a new API to receive these generated images. The Amazon API Gateway receives the PUT request (step 1).

Advertising

Advertising APIs Engineering Benchmark

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

AWS Machine Learning

NOVEMBER 30, 2023

SageMaker makes it easy to deploy models into production directly through API calls to the service. It’s a low-level API available for Java, C++, Go, JavaScript, Node.js, PHP, Ruby, and Python. It’s a low-level API available for Java, C++, Go, JavaScript, Node.js, PHP, Ruby, and Python.

Benchmark

Benchmark APIs Scripts Engineering

Common Challenges in Automated API Testing: Overcoming Obstacles with Expert Solutions

CSM Magazine

DECEMBER 27, 2023

Automated API testing stands as a cornerstone in the modern software development cycle, ensuring that applications perform consistently and accurately across diverse systems and technologies. Continuous learning and adaptation are essential, as the landscape of API technology is ever-evolving.

APIs

APIs Benchmark Best practices Technology

Secure AccountantAI Chatbot: Lili’s journey with Amazon Bedrock

AWS Machine Learning

JULY 18, 2024

The ingestion workflow transforms these curated questions into vector embeddings using Amazon Titan Text Embeddings model API. The system first converts the query into a vector embedding using the Amazon Titan Text Embeddings model API, which is accessed securely through PrivateLink.

Chatbots

Chatbots APIs Accountability Finance

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

AWS Machine Learning

FEBRUARY 8, 2023

Although you can integrate the model directly into an application, the approach that works well for production-grade applications is to deploy the model behind an endpoint and then invoke the endpoint via a RESTful API call to obtain the inference. However, you can use any other benchmarking tool. large two-core machine.

Benchmark

Benchmark Metrics APIs Engineering

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

AWS Machine Learning

OCTOBER 18, 2023

The solution uses the following services: Amazon API Gateway is a fully managed service that makes it easy for developers to publish, maintain, monitor, and secure APIs at any scale. Purina’s solution is deployed as an API Gateway HTTP endpoint, which routes the requests to obtain pet attributes.

APIs

APIs Metrics Consulting Consulting

AI21 Labs Jamba-Instruct model is now available in Amazon Bedrock

AWS Machine Learning

JUNE 25, 2024

For more information about Jamba-Instruct, including relevant benchmarks, refer to Built for the Enterprise: Introducing AI21’s Jamba-Instruct Model. Programmatic access You can also access Jamba-Instruct through an API, using Amazon Bedrock and AWS SDK for Python (Boto3).

Benchmark

Benchmark APIs Enterprise Technology

Enhance performance of generative language models with self-consistency prompting on Amazon Bedrock

AWS Machine Learning

MARCH 19, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models from leading AI companies and Amazon via a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI. A limitation of the approach is its larger computational cost.

APIs

APIs Benchmark SaaS Engineering

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning

SEPTEMBER 18, 2024

Together, these AI-driven tools and technologies aren’t just reshaping how brands perform marketing tasks; they’re setting new benchmarks for what’s possible in customer engagement. From our experience, artifact server has some limitations, such as limits on artifact size (because of sending it using REST API).

APIs

APIs Engineering Analytics Marketing

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

AWS Machine Learning

JUNE 15, 2023

We demonstrate how to use the AWS Management Console and Amazon Translate public API to deliver automatic machine batch translation, and analyze the translations between two language pairs: English and Chinese, and English and Spanish. In this post, we present a solution that D2L.ai

APIs

APIs Benchmark Best practices Engineering

How to use public reviews to improve customer experience

Lumoa

DECEMBER 8, 2022

How to add your competitors’ public reviews to the mix, so you can get a reliable benchmark and win in the market. Lumoa has very clear open APIs and based on your goals, integrations can serve various purposes. How to identify key pain points that you can immediately tackle to increase the score.

Customer Experience

Customer Experience Benchmark APIs Marketing

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

AWS Machine Learning

AUGUST 2, 2024

Queries are sent to the backend using a REST API defined in Amazon API Gateway , a fully managed service that makes it straightforward for developers to create, publish, maintain, monitor, and secure APIs at any scale, and implemented through an API Gateway private integration.

APIs

APIs Analytics Chatbots Engineering

Exciting new developments from Spearline

Spearline

FEBRUARY 3, 2020

With such a rise in popularity of mobile usage around the world, we are delighted to announce that from February 2020, our customers will be able to test the sending of an SMS message to a destination specified by them, via the Spearline API. Access real-time reporting and analytics via Spearline API polling.

Telecommunications

Telecommunications APIs Benchmark Enterprise

Maximizing ROI with CPQ: 10 Best Practices for Sales Success

Cincom

FEBRUARY 14, 2025

Use APIs and middleware to bridge gaps between CPQ and existing enterprise systems, ensuring smooth data flow. Automate Price Calculations and Adjustments Utilize real-time pricing engines within CPQ to dynamically calculate prices based on market trends, cost fluctuations, and competitor benchmarks.

Best practices

Best practices Sales CRM Finance

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

AWS Machine Learning

AUGUST 18, 2022

The team’s early benchmarking results show 7.3 The baseline model used in these benchmarking is a multi-layer perceptron neural network with seven dense fully connected layers and over 200 parameters. The following table summarizes the benchmarking result on ml.p3.16xlarge SageMaker training instances. Number of Instances.

Scripts

Scripts APIs Benchmark Engineering

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker

AWS Machine Learning

JUNE 13, 2023

In addition, deployments are now as simple as calling Boto3 SageMaker APIs and attaching the proper auto scaling policies. We already had an API layer on top of our models for model management and inference. In keeping with the instance types we were already using, we did our benchmarking with ml.g4dn.xlarge and ml.g4dn.2xlarge

APIs

APIs Benchmark Engineering Management

Improved ML model deployment using Amazon SageMaker Inference Recommender

AWS Machine Learning

APRIL 20, 2023

An advanced job is a custom load test job that allows you to perform extensive benchmarks based on your ML application SLA requirements, such as latency, concurrency, and traffic pattern. Inference Recommender uses this information to run a performance benchmark load test. Running Advanced job. sm_client = boto3.client("sagemaker",

APIs

APIs Metrics Benchmark Engineering

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

AWS Machine Learning

JUNE 6, 2024

Jina Embeddings v2 is the preferred choice for experienced ML scientists for the following reasons: State-of-the-art performance – We have shown on various text embedding benchmarks that Jina Embeddings v2 models excel on tasks such as classification, reranking, summarization, and retrieval.

Benchmark

Benchmark Enterprise Construction APIs

Image classification model selection using Amazon SageMaker JumpStart

AWS Machine Learning

FEBRUARY 6, 2023

The former question addresses model selection across model architectures, while the latter question concerns benchmarking trained models against a test dataset. This post provides details on how to implement large-scale Amazon SageMaker benchmarking and model selection tasks. swin-large-patch4-window7-224 195.4M efficientnet-b5 29.0M

APIs

APIs Scripts Metrics Benchmark

A review of purpose-built accelerators for financial services

AWS Machine Learning

SEPTEMBER 11, 2024

In terms of resulting speedups, the approximate order is programming hardware, then programming against PBA APIs, then programming in an unmanaged language such as C++, then a managed language such as Python. The CUDA API and SDK were first released by NVIDIA in 2007. GPU PBAs, 4% other PBAs, 4% FPGA, and 0.5%

Benchmark

Benchmark Banking Analytics Big data

A progress update on our commitment to safe, responsible generative AI

AWS Machine Learning

JULY 10, 2024

Red-teaming engages human testers to probe an AI system for flaws in an adversarial style, and complements our other testing techniques, which include automated benchmarking against publicly available and proprietary datasets, human evaluation of completions against proprietary datasets, and more.

Government

Government Education Best practices Benchmark

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

AWS Machine Learning

SEPTEMBER 12, 2024

On Hugging Face, the Massive Text Embedding Benchmark (MTEB) is provided as a leaderboard for diverse text embedding tasks. It currently provides 129 benchmarking datasets across 8 different tasks on 113 languages. medium instance to demonstrate deploying the model as an API endpoint using an SDK through SageMaker JumpStart.

APIs

APIs Benchmark Enterprise Construction

Build well-architected IDP solutions with a custom lens – Part 4: Performance efficiency

AWS Machine Learning

NOVEMBER 22, 2023

You can save time, money, and labor by implementing classifications in your workflow, and documents go to downstream applications and APIs based on document type. This helps you avoid throttling limits on API calls due to polling the Get* APIs.

APIs

APIs Metrics Benchmark Enterprise

The executive’s guide to generative AI for sustainability

AWS Machine Learning

APRIL 22, 2024

Examples of tools you can use to advance sustainability initiatives are: Amazon Bedrock – a fully managed service that provides access to high-performing FMs from leading AI companies through a single API, enabling you to choose the right model for your sustainability use cases.

Best practices

Best practices Benchmark Transportation Engineering

Benchmarking Amazon Nova and GPT-4o models with FloTorch

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

Trending Sources

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

Speed up your time series forecasting by up to 50 percent with Amazon SageMaker Canvas UI and AutoML APIs

LLM-as-a-judge on Amazon Bedrock Model Evaluation

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

Generate training data and cost-effectively train categorical models with Amazon Bedrock

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 2

Your guide to generative AI and ML at AWS re:Invent 2024

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

Accelerated PyTorch inference with torch.compile on AWS Graviton processors

Transition your Amazon Forecast usage to Amazon SageMaker Canvas

Build a secure enterprise application with Generative AI and RAG using Amazon SageMaker JumpStart

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

Common Challenges in Automated API Testing: Overcoming Obstacles with Expert Solutions

Secure AccountantAI Chatbot: Lili’s journey with Amazon Bedrock

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

AI21 Labs Jamba-Instruct model is now available in Amazon Bedrock

Enhance performance of generative language models with self-consistency prompting on Amazon Bedrock

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

How to use public reviews to improve customer experience

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

Exciting new developments from Spearline

Maximizing ROI with CPQ: 10 Best Practices for Sales Success

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker

Improved ML model deployment using Amazon SageMaker Inference Recommender

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

Image classification model selection using Amazon SageMaker JumpStart

A review of purpose-built accelerators for financial services

A progress update on our commitment to safe, responsible generative AI

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

Build well-architected IDP solutions with a custom lens – Part 4: Performance efficiency

The executive’s guide to generative AI for sustainability

Stay Connected