APIs, Engineering and Scripts - Customer Contact Central

Benchmarking Amazon Nova and GPT-4o models with FloTorch

AWS Machine Learning

MARCH 11, 2025

Amazon Bedrock APIs make it straightforward to use Amazon Titan Text Embeddings V2 for embedding data. The implementation used the universal gateway provided by the FloTorch enterprise version to enable consistent API calls using the same function and to track token count and latency metrics uniformly. get("message", {}).get("content")

Benchmark

Benchmark APIs Enterprise Scripts

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning

NOVEMBER 13, 2024

The solution also uses Amazon Cognito user pools and identity pools for managing authentication and authorization of users, Amazon API Gateway REST APIs, AWS Lambda functions, and an Amazon Simple Storage Service (Amazon S3) bucket. To launch the solution in a different Region, change the aws_region parameter accordingly.

APIs

APIs Scripts Accountability Entertainment

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning

NOVEMBER 14, 2024

Customers can use the SageMaker Studio UI or APIs to specify the SageMaker Model Registry model to be shared and grant access to specific AWS accounts or to everyone in the organization. We will start by using the SageMaker Studio UI and then by using APIs.

Government

Government Management APIs Accountability

Few-shot prompt engineering and fine-tuning for LLMs in Amazon Bedrock

AWS Machine Learning

AUGUST 2, 2024

Traditionally, earnings call scripts have followed similar templates, making it a repeatable task to generate them from scratch each time. On the other hand, generative artificial intelligence (AI) models can learn these templates and produce coherent scripts when fed with quarterly financial data.

Engineering

Engineering Scripts Metrics Advertising

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

AWS Machine Learning

APRIL 24, 2025

The top-level definitions of these abstractions are included as part of the prompt context for query generation, and the full definitions are provided to the SQL execution engine, along with the generated query. Depending on the use case, this can be a static or dynamically generated script. A domain-specific user prompt.

Enterprise

Enterprise Construction Scripts APIs

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

AWS Machine Learning

NOVEMBER 15, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

APIs

APIs Government Best practices Metrics

Secure a generative AI assistant with OWASP Top 10 mitigation

AWS Machine Learning

JANUARY 24, 2025

These steps might involve both the use of an LLM and external data sources and APIs. Agent plugin controller This component is responsible for the API integration to external data sources and APIs. The LLM agent is an orchestrator of a set of steps that might be necessary to complete the desired request.

APIs

APIs Scripts Best practices Management

Generate training data and cost-effectively train categorical models with Amazon Bedrock

AWS Machine Learning

MARCH 27, 2025

This requirement translates into time and effort investment of trained personnel, who could be support engineers or other technical staff, to review tens of thousands of support cases to arrive at an even distribution of 3,000 per category. Sonnet prediction accuracy through prompt engineering. client = boto3.client("bedrock-runtime",

Education

Education Engineering APIs Enterprise

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

AWS Machine Learning

APRIL 18, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Stability AI, and Amazon with a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Scripts

Scripts Best practices Engineering Accountability

Create an end-to-end serverless digital assistant for semantic search with Amazon Bedrock

AWS Machine Learning

JULY 2, 2024

Amazon Bedrock is a fully managed service that makes a wide range of foundation models (FMs) available though an API without having to manage any infrastructure. An Amazon OpenSearch Serverless vector engine to store enterprise data as vectors to perform semantic search. The request is sent by the web application to the API.

APIs

APIs Scripts Enterprise Engineering

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

AWS Machine Learning

SEPTEMBER 20, 2024

Amazon Bedrock agents use LLMs to break down tasks, interact dynamically with users, run actions through API calls, and augment knowledge using Amazon Bedrock Knowledge Bases. In this post, we demonstrate how to use Amazon Bedrock Agents with a web search API to integrate dynamic web content in your generative AI application.

APIs

APIs Chatbots Construction Engineering

Bring legacy machine learning code into Amazon SageMaker using AWS Step Functions

AWS Machine Learning

MARCH 15, 2023

The best practice for migration is to refactor these legacy codes using the Amazon SageMaker API or the SageMaker Python SDK. We demonstrate how two different personas, a data scientist and an MLOps engineer, can collaborate to lift and shift hundreds of legacy models. SageMaker runs the legacy script inside a processing container.

Scripts

Scripts APIs Engineering Construction

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

AWS Machine Learning

FEBRUARY 25, 2025

We recommend running similar scripts only on your own data sources after consulting with the team who manages them, or be sure to follow the terms of service for the sources that youre trying to fetch data from. A simple architectural representation of the steps involved is shown in the following figure. secrets_manager_client = boto3.client('secretsmanager')

Enterprise

Enterprise Engineering APIs Accountability

Build an image search engine with Amazon Kendra and Amazon Rekognition

AWS Machine Learning

MAY 5, 2023

To address the problems associated with complex searches, this post describes in detail how you can achieve a search engine that is capable of searching for complex images by integrating Amazon Kendra and Amazon Rekognition. A Python script is used to aid in the process of uploading the datasets and generating the manifest file.

Engineering

Engineering APIs Scripts Enterprise

Secure Amazon SageMaker Studio presigned URLs Part 3: Multi-account private API access to Studio

AWS Machine Learning

APRIL 11, 2023

In the post Secure Amazon SageMaker Studio presigned URLs Part 2: Private API with JWT authentication , we demonstrated how to build a private API to generate Amazon SageMaker Studio presigned URLs that are only accessible by an authenticated end-user within the corporate network from a single account.

APIs

APIs Accountability Scripts Enterprise

Using Agents for Amazon Bedrock to interactively generate infrastructure as code

AWS Machine Learning

JULY 11, 2024

Agents for Amazon Bedrock automates the prompt engineering and orchestration of user-requested tasks. This solution uses Retrieval Augmented Generation (RAG) to ensure the generated scripts adhere to organizational needs and industry standards. A GitHub account with a repository to store the generated Terraform scripts.

Scripts

Scripts APIs Engineering industry standards

Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%

AWS Machine Learning

DECEMBER 22, 2023

In particular, we cover the SMP library’s new simplified user experience that builds on open source PyTorch Fully Sharded Data Parallel (FSDP) APIs, expanded tensor parallel functionality that enables training models with hundreds of billions of parameters, and performance optimizations that reduce model training time and cost by up to 20%.

Scripts

Scripts APIs Engineering Finance

Fine-tune and deploy a summarizer model using the Hugging Face Amazon SageMaker containers bringing your own script

AWS Machine Learning

JULY 29, 2022

The SageMaker Python SDK provides open-source APIs and containers to train and deploy models on SageMaker, using several different ML and deep learning frameworks. Build your training script for the Hugging Face SageMaker estimator. script to use with Script Mode and pass hyperparameters for training. to(device).

Scripts

Scripts APIs Big data Engineering

Import a fine-tuned Meta Llama 3 model for SQL query generation on Amazon Bedrock

AWS Machine Learning

AUGUST 1, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading artificial intelligence (AI) companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API. The scripts for fine-tuning and evaluation are available on the GitHub repository.

APIs

APIs Scripts Real estate Accountability

How LotteON built a personalized recommendation system using Amazon SageMaker and MLOps

AWS Machine Learning

MAY 16, 2024

The main AWS services used are SageMaker, Amazon EMR , AWS CodeBuild , Amazon Simple Storage Service (Amazon S3), Amazon EventBridge , AWS Lambda , and Amazon API Gateway. Real-time recommendation inference The inference phase consists of the following steps: The client application makes an inference request to the API gateway.

Personalization

Personalization APIs Scripts Technical Support

Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints

AWS Machine Learning

FEBRUARY 19, 2024

Inference API – The server exposes an API that allows client applications to send input data and receive predictions from the deployed models. Integration with backend engines – Model servers have integrations with backend frameworks like DeepSpeed and FasterTransformer to partition large models and run highly optimized inference.

APIs

APIs Engineering Scripts Metrics

Get smarter search results with the Amazon Kendra Intelligent Ranking and OpenSearch plugin

AWS Machine Learning

JANUARY 9, 2023

using open source or commercial-off-the-shelf search engines, then you’re probably familiar with the inherent accuracy challenges involved in getting relevant search results. You need your search engine to be smarter so it can rank documents based on matching the meaning or semantics of the content to the intention of the user’s query.

Scripts

Scripts APIs Engineering Transportation

Harness large language models in fake news detection

AWS Machine Learning

NOVEMBER 14, 2023

The solution also uses Amazon Bedrock , a fully managed service that makes foundation models (FMs) from Amazon and third-party model providers accessible through the AWS Management Console and APIs. First, we discuss those two prompt engineering techniques, then we show their implementation using LangChain and Amazon Bedrock.

APIs

APIs Engineering Scripts Education

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

AWS Machine Learning

FEBRUARY 12, 2025

Amazon Bedrock is a fully managed service that makes FMs from leading AI startups and Amazon available through an API, so you can choose from a wide range of FMs to find the model that is best suited for your use case. Solution overview The solution comprises two main steps: Generate synthetic data using the Amazon Bedrock InvokeModel API.

APIs

APIs Management Benchmark Scripts

Modernizing data science lifecycle management with AWS and Wipro

AWS Machine Learning

JANUARY 5, 2024

Wipro further accelerated their ML model journey by implementing Wipro’s code accelerators and snippets to expedite feature engineering, model training, model deployment, and pipeline creation. Wipro has used the input filter and join functionality of SageMaker batch transformation API.

Management

Management APIs Engineering Government

Use Amazon Titan models for image generation, editing, and searching

AWS Machine Learning

FEBRUARY 19, 2024

An asynchronous API and Amazon OpenSearch Service connector make it easy to integrate the model into your neural search applications. Before you can write scripts that use the Amazon Bedrock API, you need to install the appropriate version of the AWS SDK in your environment. The vectors power speedy, accurate search experiences.

Scripts

Scripts APIs Entertainment Engineering

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning

APRIL 29, 2024

This often means the method of using a third-party LLM API won’t do for security, control, and scale reasons. It provides an approachable, robust Python API for the full infrastructure stack of ML/AI, from data and compute to workflows and observability. The following figure illustrates this workflow.

APIs

APIs Engineering Scripts Management

Unlock the potential of generative AI in industrial operations

AWS Machine Learning

MARCH 19, 2024

Workers gain productivity through AI-generated insights, engineers can proactively detect anomalies, supply chain managers optimize inventories, and plant leadership makes informed, data-driven decisions. The user can use the Amazon Recognition DetectText API to extract text data from these images. setup.sh. (a

Construction

Construction Scripts APIs Enterprise

Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction

AWS Machine Learning

DECEMBER 21, 2022

Amazon SageMaker Feature Store is a purpose-built feature management solution that helps data scientists and ML engineers securely store, discover, and share curated data used in training and prediction workflows. In this example, we ingest records using the FeatureGroup.ingest() API, which ingests records from a Pandas DataFrame.

Scripts

Scripts Engineering APIs Big data

Deploy a Slack gateway for Amazon Bedrock

AWS Machine Learning

JUNE 19, 2024

The Slack application sends the event to Amazon API Gateway , which is used in the event subscription. API Gateway forwards the event to an AWS Lambda function. About the Authors Rushabh Lokhande is a Senior Data & ML Engineer with AWS Professional Services Analytics Practice.

APIs

APIs Engineering Accountability Big data

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning

MAY 30, 2024

AWS Prototyping successfully delivered a scalable prototype, which solved CBRE’s business problem with a high accuracy rate (over 95%) and supported reuse of embeddings for similar NLQs, and an API gateway for integration into CBRE’s dashboards. The following diagram illustrates the web interface and API management layer.

Real estate

Real estate APIs Metrics Construction

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

AWS Machine Learning

AUGUST 18, 2022

Machine learning (ML) experts, data scientists, engineers and enthusiasts have encountered this problem the world over. In some ways similar to what Keras did for TensorFlow, or even arguably Hugging Face, PyTorch Lightning provides a high-level API with abstractions for much of the lower-level functionality of PyTorch itself.

Scripts

Scripts APIs Benchmark Engineering

Easily build semantic image search using Amazon Titan

AWS Machine Learning

NOVEMBER 30, 2023

The function then searches the OpenSearch Service image index for images matching the celebrity name and the k-nearest neighbors for the vector using cosine similarity using Exact k-NN with scoring script. Go to the CloudFormation console, choose the stack that you deployed through the deploy script mentioned previously, and delete the stack.

Scripts

Scripts APIs Entertainment Accountability

Enhancing LLM Capabilities with NeMo Guardrails on Amazon SageMaker JumpStart

AWS Machine Learning

FEBRUARY 5, 2025

Lets delve into a basic Colang script to see how it works: define user express greeting "hello" "hi" "what's up?" define flow greeting user express greeting bot express greeting bot ask how are you In this script, we see the three fundamental types of blocks in Colang: User Message Blocks (define user ): These define possible user inputs.

Chatbots

Chatbots Construction Best practices APIs

How to extend the functionality of AWS Trainium with custom operators

AWS Machine Learning

APRIL 27, 2023

Trainium support for custom operators Trainium (and AWS Inferentia2) supports CustomOps in software through the Neuron SDK and accelerates them in hardware using the GPSIMD engine (General Purpose Single Instruction Multiple Data engine). The scalar and vector engines are highly parallelized and optimized for floating-point operations.

APIs

APIs Engineering Scripts Benchmark

Optimal pricing for maximum profit using Amazon SageMaker

AWS Machine Learning

AUGUST 4, 2022

This is a guest post by Viktor Enrico Jeney, Senior Machine Learning Engineer at Adspert. The repricing ML model is a Scikit-Learn Random Forest implementation in SageMaker Script Mode, which is trained using data available in the S3 bucket (the analytics layer). This may be different to the partitioning used on the stage layer.

Scripts

Scripts Advertising Engineering Analytics

Simplify access to internal information using Retrieval Augmented Generation and LangChain Agents

AWS Machine Learning

SEPTEMBER 14, 2023

Amazon API Gateway hosts a REST API with various endpoints to handle user requests that are authenticated using Amazon Cognito. Finally, the response is sent back to the user via a HTTPs request through the Amazon API Gateway REST API integration response. The web application front-end is hosted on AWS Amplify.

APIs

APIs Healthcare Scripts Engineering

Accelerate Mixtral 8x7B pre-training with expert parallelism on Amazon SageMaker

AWS Machine Learning

MAY 23, 2024

The SMP library uses NVIDIA Megatron to implement expert parallelism and support training MoE models, and runs on top of PyTorch Fully Sharded Data Parallel (FSDP) APIs. With SageMaker training jobs, you can launch and manage clusters of high-performance instances with simple API calls. In this example, we use SageMaker training jobs.

APIs

APIs Engineering Scripts Construction

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

AWS Machine Learning

APRIL 19, 2023

Our data scientists train the model in Python using tools like PyTorch and save the model as PyTorch scripts. Ideally, we instead want to load the model PyTorch scripts, extract the features from model input, and run model inference entirely in Java. They use the DJL PyTorch engine to initialize the model predictor.

Engineering

Engineering APIs Scripts Education

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

AWS Machine Learning

JUNE 27, 2023

This post explains how Provectus and Earth.com were able to enhance the AI-powered image recognition capabilities of EarthSnap, reduce engineering heavy lifting, and minimize administrative costs by implementing end-to-end ML pipelines, delivered as part of a managed MLOps platform and managed AI services.

Engineering

Engineering APIs Scripts Entertainment

Build a contextual chatbot application using Knowledge Bases for Amazon Bedrock

AWS Machine Learning

FEBRUARY 19, 2024

The integration of retrieval and generation also requires additional engineering effort and computational resources. For text generation, Amazon Bedrock provides the RetrieveAndGenerate API to create embeddings of user queries, and retrieves relevant chunks from the vector database to generate accurate responses.

Chatbots

Chatbots APIs Engineering Enterprise

Find answers accurately and quickly using Amazon Q Business with the SharePoint Online connector

AWS Machine Learning

JULY 25, 2024

Any additional mappings need to be set in the user store using the user store APIs. Overview of solution This post presents the steps to create a certificate and private key, configure Azure AD (either using the Azure AD console or a PowerShell script), and configure Amazon Q Business. Using the provided PowerShell script.

Scripts

Scripts APIs Engineering Enterprise

Understanding and predicting urban heat islands at Gramener using Amazon SageMaker geospatial capabilities

AWS Machine Learning

APRIL 5, 2024

Gramener’s GeoBox solution empowers users to effortlessly tap into and analyze public geospatial data through its powerful API, enabling seamless integration into existing workflows. With the SearchRasterDataCollection API, SageMaker provides a purpose-built functionality to facilitate the retrieval of satellite imagery.

APIs

APIs Engineering Analytics Healthcare

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 1

AWS Machine Learning

OCTOBER 24, 2024

The retrieve_and_generate API does both the retrieval and a call to an FM (Amazon Titan or Anthropic’s Claude family of models on Amazon Bedrock ), for a fully managed solution. The retriever isn’t at fault, the problem is with FM generation (evaluated by a human or LLM): Try prompt engineering to mitigate hallucinations.

Chatbots

Chatbots Metrics Scripts APIs

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Trending Sources

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Few-shot prompt engineering and fine-tuning for LLMs in Amazon Bedrock

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Considerations for addressing the core dimensions of responsible AI for Amazon Bedrock applications

Secure a generative AI assistant with OWASP Top 10 mitigation

Generate training data and cost-effectively train categorical models with Amazon Bedrock

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

Create an end-to-end serverless digital assistant for semantic search with Amazon Bedrock

Integrate dynamic web content in your generative AI application using a web search API and Amazon Bedrock Agents

Bring legacy machine learning code into Amazon SageMaker using AWS Step Functions

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

Build an image search engine with Amazon Kendra and Amazon Rekognition

Secure Amazon SageMaker Studio presigned URLs Part 3: Multi-account private API access to Studio

Using Agents for Amazon Bedrock to interactively generate infrastructure as code

Amazon SageMaker model parallel library now accelerates PyTorch FSDP workloads by up to 20%

Fine-tune and deploy a summarizer model using the Hugging Face Amazon SageMaker containers bringing your own script

Import a fine-tuned Meta Llama 3 model for SQL query generation on Amazon Bedrock

How LotteON built a personalized recommendation system using Amazon SageMaker and MLOps

Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints

Get smarter search results with the Amazon Kendra Intelligent Ranking and OpenSearch plugin

Harness large language models in fake news detection

Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock

Modernizing data science lifecycle management with AWS and Wipro

Use Amazon Titan models for image generation, editing, and searching

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

Unlock the potential of generative AI in industrial operations

­­Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction

Deploy a Slack gateway for Amazon Bedrock

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

Easily build semantic image search using Amazon Titan

Enhancing LLM Capabilities with NeMo Guardrails on Amazon SageMaker JumpStart

How to extend the functionality of AWS Trainium with custom operators

Optimal pricing for maximum profit using Amazon SageMaker

Simplify access to internal information using Retrieval Augmented Generation and LangChain Agents

Accelerate Mixtral 8x7B pre-training with expert parallelism on Amazon SageMaker

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

Build a contextual chatbot application using Knowledge Bases for Amazon Bedrock

Find answers accurately and quickly using Amazon Q Business with the SharePoint Online connector

Understanding and predicting urban heat islands at Gramener using Amazon SageMaker geospatial capabilities

From RAG to fabric: Lessons learned from building real-world RAGs at GenAIIC – Part 1

Stay Connected

Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction