APIs, Benchmark and Data - Customer Contact Central

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

Using its enterprise software, FloTorch conducted an extensive comparison between Amazon Nova models and OpenAIs GPT-4o models with the Comprehensive Retrieval Augmented Generation (CRAG) benchmark dataset. FloTorch used these queries and their ground truth answers to create a subset benchmark dataset.

Benchmark

Benchmark APIs Enterprise Scripts

Generate training data and cost-effectively train categorical models with Amazon Bedrock

AWS Machine Learning

MARCH 27, 2025

In this post, we explore how you can use Amazon Bedrock to generate high-quality categorical ground truth data, which is crucial for training machine learning (ML) models in a cost-sensitive environment. For the multiclass classification problem to label support case data, synthetic data generation can quickly result in overfitting.

Education

Education Engineering APIs Enterprise

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

AWS Machine Learning

FEBRUARY 12, 2025

Amazon Bedrock is a fully managed service that makes FMs from leading AI startups and Amazon available through an API, so you can choose from a wide range of FMs to find the model that is best suited for your use case. One consistent pain point of fine-tuning is the lack of data to effectively customize these models.

APIs

APIs Management Benchmark Scripts

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

AWS Machine Learning

AUGUST 2, 2024

With GraphStorm, you can build solutions that directly take into account the structure of relationships or interactions between billions of entities, which are inherently embedded in most real-world data, including fraud detection scenarios, recommendations, community detection, and search/retrieval problems. Specifically, GraphStorm 0.3

APIs

APIs Benchmark Construction Enterprise

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

AWS Machine Learning

JULY 9, 2024

Sonnet currently ranks at the top of S&P AI Benchmarks by Kensho , which assesses large language models (LLMs) for finance and business. For example, there could be leakage of benchmark datasets’ questions and answers into training data. Anthropic Claude 3.5 Kensho is the AI Innovation Hub for S&P Global.

Finance

Finance Benchmark industry standards Accountability

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

AWS Machine Learning

MARCH 3, 2025

Overview of Pixtral 12B Pixtral 12B, Mistrals inaugural VLM, delivers robust performance across a range of benchmarks, surpassing other open models and rivaling larger counterparts, according to Mistrals evaluation. Performance metrics and benchmarks Pixtral 12B is trained to understand both natural images and documents, achieving 52.5%

Benchmark

Benchmark APIs Enterprise Construction

LLM-as-a-judge on Amazon Bedrock Model Evaluation

AWS Machine Learning

FEBRUARY 12, 2025

Amazon Bedrock , a fully managed service offering high-performing foundation models from leading AI companies through a single API, has recently introduced two significant evaluation capabilities: LLM-as-a-judge under Amazon Bedrock Model Evaluation and RAG evaluation for Amazon Bedrock Knowledge Bases.

Metrics

Metrics Engineering APIs Benchmark

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

AWS Machine Learning

JANUARY 28, 2025

Consider benchmarking your user experience to find the best latency for your use case, considering that most humans cant read faster than 225 words per minute and therefore extremely fast response can hinder user experience. This variation stems from data travel time across networks and geographic distances.

Benchmark

Benchmark APIs Engineering Metrics

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 2

AWS Machine Learning

NOVEMBER 15, 2024

This post focuses on doing RAG on heterogeneous data formats. We first introduce routers, and how they can help managing diverse data sources. We then give tips on how to handle tabular data and will conclude with multimodal RAG, focusing specifically on solutions that handle both text and image data.

APIs

APIs Engineering Chatbots Construction

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart

AWS Machine Learning

JANUARY 29, 2024

This post explores these relationships via a comprehensive benchmarking of LLMs available in Amazon SageMaker JumpStart, including Llama 2, Falcon, and Mistral variants. We provide theoretical principles on how accelerator specifications impact LLM benchmarking. Additionally, models are fully sharded on the supported instance.

Benchmark

Benchmark APIs Enterprise Accountability

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

AWS Machine Learning

MARCH 6, 2025

In the rapidly evolving landscape of artificial intelligence, Retrieval Augmented Generation (RAG) has emerged as a game-changer, revolutionizing how Foundation Models (FMs) interact with organization-specific data. It provides tools for chaining LLM operations, managing context, and integrating external data sources.

Metrics

Metrics Enterprise APIs Engineering

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning

NOVEMBER 19, 2024

The technical sessions covering generative AI are divided into six areas: First, we’ll spotlight Amazon Q , the generative AI-powered assistant transforming software development and enterprise data utilization. We’ll cover Amazon Bedrock Agents , capable of running complex tasks using your company’s systems and data.

APIs

APIs Enterprise Best practices Government

Intelligent healthcare forms analysis with Amazon Bedrock

AWS Machine Learning

AUGUST 13, 2024

Generative artificial intelligence (AI) provides an opportunity for improvements in healthcare by combining and analyzing structured and unstructured data across previously disconnected silos. Figure 1: Architecture – Standard Form – Data Extraction & Storage.

Healthcare

Healthcare APIs Consulting Consulting

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

AWS Machine Learning

FEBRUARY 24, 2025

Performance metrics and benchmarks According to Mistral, the instruction-tuned version of the model achieves over 81% accuracy on Massive Multitask Language Understanding (MMLU) with 150 tokens per second latency, making it currently the most efficient model in its category. It doesnt support Converse APIs or other Amazon Bedrock tooling.

APIs

APIs Enterprise Benchmark Feedback

How Mend.io unlocked hidden patterns in CVE data with Anthropic Claude on Amazon Bedrock

AWS Machine Learning

JULY 18, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

APIs

APIs Technology Analytics Benchmark

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

AWS Machine Learning

MAY 2, 2024

They are commonly used in knowledge bases to represent textual data as dense vectors, enabling efficient similarity search and retrieval. A common way to select an embedding model (or any model) is to look at public benchmarks; an accepted benchmark for measuring embedding quality is the MTEB leaderboard.

Benchmark

Benchmark Metrics Enterprise APIs

Enable data sharing through federated learning: A policy approach for chief digital officers

AWS Machine Learning

MARCH 15, 2024

This is a guest blog post written by Nitin Kumar, a Lead Data Scientist at T and T Consulting Services, Inc. Medical data restrictions You can use machine learning (ML) to assist doctors and researchers in diagnosis tasks, thereby speeding up the process. This isolated legacy data has the potential for massive impact if cumulated.

Healthcare

Healthcare Government Best practices Engineering

Reduce conversational AI response time through inference at the edge with AWS Local Zones

AWS Machine Learning

MARCH 3, 2025

First is the network latency, which is the round-trip time for data transmission between the device and the cloud. They enable applications requiring very low latency or local data processing using familiar APIs and tool sets. TTFT consists of two components. model=meta-llama/Llama-3.2-3B

APIs

APIs Benchmark Metrics Healthcare

Train gigantic models with near-linear scaling using sharded data parallelism on Amazon SageMaker

AWS Machine Learning

OCTOBER 31, 2022

Data scientists and machine learning engineers are constantly looking for the best way to optimize their training compute, yet are struggling with the communication overhead that can increase along with the overall cluster size. speed up compared to PyTorch’s Fully Sharded Data Parallel (FSDP). on 256 GPUs.

Scripts

Scripts Benchmark APIs Engineering

Build a secure enterprise application with Generative AI and RAG using Amazon SageMaker JumpStart

AWS Machine Learning

SEPTEMBER 6, 2023

It’s powered by large language models (LLMs) that are pre-trained on vast amounts of data and commonly referred to as foundation models (FMs). These SageMaker endpoints are consumed in the Amplify React application through Amazon API Gateway and AWS Lambda functions. This dataset is a large corpus of legal and administrative data.

Enterprise

Enterprise APIs Real estate Construction

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

AWS Machine Learning

OCTOBER 18, 2023

The solution focuses on the fundamental principles of developing an AI/ML application workflow of data preparation, model training, model evaluation, and model monitoring. Additionally, it often requires thousands or tens of thousands of hand-labeled images to provide the model with enough data to accurately make decisions.

APIs

APIs Metrics Consulting Consulting

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

AWS Machine Learning

NOVEMBER 30, 2023

Amazon SageMaker is a fully managed service that enables developers and data scientists to quickly and easily build, train, and deploy machine learning (ML) models at scale. SageMaker makes it easy to deploy models into production directly through API calls to the service. SageMaker provides a variety of options to deploy models.

Benchmark

Benchmark APIs Scripts Engineering

Maximizing ROI with CPQ: 10 Best Practices for Sales Success

Cincom

FEBRUARY 14, 2025

Integrate CPQ Seamlessly with CRM, ERP, and Contract Management Systems Ensure bidirectional data synchronization between CPQ and CRM so that your sales reps can access the latest customer data and pricing configurations. Use APIs and middleware to bridge gaps between CPQ and existing enterprise systems, ensuring smooth data flow.

Best practices

Best practices Sales CRM Finance

Common Challenges in Automated API Testing: Overcoming Obstacles with Expert Solutions

CSM Magazine

DECEMBER 27, 2023

Automated API testing stands as a cornerstone in the modern software development cycle, ensuring that applications perform consistently and accurately across diverse systems and technologies. Continuous learning and adaptation are essential, as the landscape of API technology is ever-evolving.

APIs

APIs Benchmark Best practices Technology

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

AWS Machine Learning

MAY 15, 2024

Acting as a model hub, JumpStart provided a large selection of foundation models and the team quickly ran their benchmarks on candidate models. Regarding the inference, customers using Amazon Ads now have a new API to receive these generated images. The Amazon API Gateway receives the PUT request (step 1).

Advertising

Advertising APIs Engineering Benchmark

The executive’s guide to generative AI for sustainability

AWS Machine Learning

APRIL 22, 2024

These include the ability to analyze massive amounts of data, identify patterns, summarize documents, perform translations, correct errors, or answer questions. These examples include speeding up market trend analysis, ensuring accurate risk management and compliance, and facilitating data collection or report generation.

Best practices

Best practices Benchmark Transportation Engineering

Exciting new developments from Spearline

Spearline

FEBRUARY 3, 2020

With such a rise in popularity of mobile usage around the world, we are delighted to announce that from February 2020, our customers will be able to test the sending of an SMS message to a destination specified by them, via the Spearline API. Access real-time reporting and analytics via Spearline API polling.

Telecommunications

Telecommunications APIs Benchmark Enterprise

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

AWS Machine Learning

SEPTEMBER 12, 2024

Organizations generate vast amounts of data that is proprietary to them, and it’s critical to get insights out of the data for better business outcomes. Generative AI and foundation models (FMs) play an important role in creating applications using an organization’s data that improve customer experiences and employee productivity.

APIs

APIs Benchmark Enterprise Construction

A review of purpose-built accelerators for financial services

AWS Machine Learning

SEPTEMBER 11, 2024

Data contains information, and information can be used to predict future behaviors, from the buying habits of customers to securities returns. Businesses are seeking a competitive advantage by being able to use the data they hold, apply it to their unique understanding of their business domain, and then generate actionable insights from it.

Benchmark

Benchmark Banking Analytics Big data

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

AWS Machine Learning

JUNE 15, 2023

We demonstrate how to use the AWS Management Console and Amazon Translate public API to deliver automatic machine batch translation, and analyze the translations between two language pairs: English and Chinese, and English and Spanish. This results in translations that better match the style and content of the parallel data.

APIs

APIs Benchmark Best practices Engineering

Evaluation of generative AI techniques for clinical report summarization

AWS Machine Learning

MAY 13, 2024

This is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading artificial intelligence (AI) companies like AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon through a single API. We used the MIMIC CXR dataset , which can be accessed through a data use agreement.

Healthcare

Healthcare Engineering APIs Benchmark

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

AWS Machine Learning

FEBRUARY 8, 2023

Building ML models involves preparing the data for training, extracting features, and then training and fine-tuning the model using the features. Next, the model has to be put to work so that it can generate inference (or predictions) from new data, which can then be used in the application. large two-core machine.

Benchmark

Benchmark Metrics APIs Engineering

AI21 Labs Jamba-Instruct model is now available in Amazon Bedrock

AWS Machine Learning

JUNE 25, 2024

For more information about Jamba-Instruct, including relevant benchmarks, refer to Built for the Enterprise: Introducing AI21’s Jamba-Instruct Model. Programmatic access You can also access Jamba-Instruct through an API, using Amazon Bedrock and AWS SDK for Python (Boto3).

APIs

APIs Benchmark Enterprise Technology

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

AWS Machine Learning

JUNE 21, 2024

eSentire is an industry-leading provider of Managed Detection & Response (MDR) services protecting users, data, and applications of over 2,000 organizations globally across more than 35 industries. This helps customers quickly and seamlessly explore their security data and accelerate internal investigations.

Engineering

Engineering Construction APIs Benchmark

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

AWS Machine Learning

AUGUST 18, 2022

So much data, so little time. Machine learning (ML) experts, data scientists, engineers and enthusiasts have encountered this problem the world over. SageMaker model training now has support for native PyTorch Distributed Data Parallel with NCCL backend, allowing developers to migrate onto SageMaker easier than ever before.

Scripts

Scripts APIs Benchmark Engineering

Build well-architected IDP solutions with a custom lens – Part 4: Performance efficiency

AWS Machine Learning

NOVEMBER 22, 2023

Consider revisiting the logging design of the telemetry data and adding infrastructure as code (IaC), such as document processing pipelines, to the solution. Rather than requiring your data science and IT teams to build and maintain AI models, you can use pre-trained AI services that can automate tasks for you.

APIs

APIs Metrics Benchmark Enterprise

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

AWS Machine Learning

JUNE 6, 2024

RAG is the process of optimizing the output of a large language model (LLM) so it references an authoritative knowledge base outside of its training data sources before generating a response. With SageMaker JumpStart, the model is deployed in an AWS secure environment and under your VPC controls, helping provide data security.

Benchmark

Benchmark Enterprise Construction APIs

Gemma is now available in Amazon SageMaker JumpStart

AWS Machine Learning

MARCH 13, 2024

Because the models are hosted and deployed on AWS, your data, whether used for evaluating the model or using it at scale, is never shared with third parties. Choose the model card to view details about the model such as the license, data used to train, and how to use the model. This looks pretty good!

Benchmark

Benchmark Scripts APIs Feedback

Snowflake Arctic models are now available in Amazon SageMaker JumpStart

AWS Machine Learning

AUGUST 22, 2024

Snowflake Arctic is a family of enterprise-grade large language models (LLMs) built by Snowflake to cater to the needs of enterprise users, exhibiting exceptional capabilities (as shown in the following benchmarks ) in SQL querying, coding, and accurately following instructions. Snowflake Arctic models are available under an Apache 2.0

Enterprise

Enterprise APIs Benchmark Scripts

Improved ML model deployment using Amazon SageMaker Inference Recommender

AWS Machine Learning

APRIL 20, 2023

The majority of data is non-fraudulent (284,315 samples), with only 492 samples corresponding to fraudulent examples. In the data, Class is the target classification variable (fraudulent vs. non-fraudulent) in the first column, followed by other variables. The class column corresponds to whether or not a transaction is fraudulent.

APIs

APIs Metrics Benchmark Engineering

Scalable intelligent document processing using Amazon Bedrock

AWS Machine Learning

JUNE 12, 2024

In today’s data-driven business landscape, the ability to efficiently extract and process information from a wide range of documents is crucial for informed decision-making and maintaining a competitive edge. Confidence scores and human review Maintaining data accuracy and quality is paramount in any document processing solution.

APIs

APIs Accountability Benchmark Government

Testing times: testingRTC is the smart, synchronized, real-world scenario WebRTC testing solution for the times we live in.

Spearline

JULY 21, 2022

And testingRTC offers multiple ways to export these metrics, from direct collection from webhooks, to downloading results in CSV format using the REST API. And all of this data can be broken down further by probe (browser). You can drill down into any of the users for more detailed information as well as check additional channel data.

Scripts

Scripts APIs Metrics Analytics

Product News – May 2023

Lumoa

JULY 6, 2023

Summarize thousands of feedback with just one click Use the power of AI to save time and stress Safe and secure – none of your data will be stored anywhere outside of Lumoa If you want to also get access to the new GPT functionality, and be on the waitlist for cutting edge features, contact your CS manager or help@lumoa.me

APIs

APIs industry standards Surveys Benchmark

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker

AWS Machine Learning

JUNE 13, 2023

This hyper-personalization is achieved through fine-tuning embedding models and classifiers on customer data, ensuring accurate information retrieval results and domain knowledge that caters to each client’s unique needs. In addition, deployments are now as simple as calling Boto3 SageMaker APIs and attaching the proper auto scaling policies.

APIs

APIs Benchmark Engineering Management

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Generate training data and cost-effectively train categorical models with Amazon Bedrock

Trending Sources

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

GraphStorm 0.3: Scalable, multi-task learning on graphs with user-friendly APIs

Anthropic Claude 3.5 Sonnet ranks number 1 for business and finance in S&P AI Benchmarks by Kensho

Pixtral-12B-2409 is now available on Amazon Bedrock Marketplace

LLM-as-a-judge on Amazon Bedrock Model Evaluation

Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 2

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart

Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

Your guide to generative AI and ML at AWS re:Invent 2024

Intelligent healthcare forms analysis with Amazon Bedrock

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace

How Mend.io unlocked hidden patterns in CVE data with Anthropic Claude on Amazon Bedrock

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

Enable data sharing through federated learning: A policy approach for chief digital officers

Reduce conversational AI response time through inference at the edge with AWS Local Zones

Train gigantic models with near-linear scaling using sharded data parallelism on Amazon SageMaker

Build a secure enterprise application with Generative AI and RAG using Amazon SageMaker JumpStart

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

Maximizing ROI with CPQ: 10 Best Practices for Sales Success

Common Challenges in Automated API Testing: Overcoming Obstacles with Expert Solutions

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

The executive’s guide to generative AI for sustainability

Exciting new developments from Spearline

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

A review of purpose-built accelerators for financial services

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

Evaluation of generative AI techniques for clinical report summarization

Optimize your machine learning deployments with auto scaling on Amazon SageMaker

AI21 Labs Jamba-Instruct model is now available in Amazon Bedrock

eSentire delivers private and secure generative AI interactions to customers with Amazon SageMaker

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

Build well-architected IDP solutions with a custom lens – Part 4: Performance efficiency

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

Gemma is now available in Amazon SageMaker JumpStart

Snowflake Arctic models are now available in Amazon SageMaker JumpStart

Improved ML model deployment using Amazon SageMaker Inference Recommender

Scalable intelligent document processing using Amazon Bedrock

Testing times: testingRTC is the smart, synchronized, real-world scenario WebRTC testing solution for the times we live in.

Product News – May 2023

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker

Stay Connected