APIs, Big data and Training - Customer Contact Central

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning

NOVEMBER 14, 2024

Customers can use the SageMaker Studio UI or APIs to specify the SageMaker Model Registry model to be shared and grant access to specific AWS accounts or to everyone in the organization. We will start by using the SageMaker Studio UI and then by using APIs.

Government

Government Management APIs Accountability

Schedule Amazon SageMaker notebook jobs and manage multi-step notebook workflows using APIs

AWS Machine Learning

NOVEMBER 29, 2023

Amazon SageMaker Studio provides a fully managed solution for data scientists to interactively build, train, and deploy machine learning (ML) models. Amazon SageMaker notebook jobs allow data scientists to run their notebooks on demand or on a schedule with a few clicks in SageMaker Studio.

APIs

APIs Management Engineering Big data

Secure Amazon SageMaker Studio presigned URLs Part 2: Private API with JWT authentication

AWS Machine Learning

JUNE 30, 2022

In this post, we will continue to build on top of the previous solution to demonstrate how to build a private API Gateway via Amazon API Gateway as a proxy interface to generate and access Amazon SageMaker presigned URLs. The user invokes createStudioPresignedUrl API on API Gateway along with a token in the header.

APIs

APIs Enterprise Big data Accountability

How Formula 1® uses generative AI to accelerate race-day issue resolution

AWS Machine Learning

FEBRUARY 18, 2025

During these live events, F1 IT engineers must triage critical issues across its services, such as network degradation to one of its APIs. This impacts downstream services that consume data from the API, including products such as F1 TV, which offer live and on-demand coverage of every race as well as real-time telemetry.

APIs

APIs Engineering Metrics Big data

Reducing hallucinations in LLM agents with a verified semantic cache using Amazon Bedrock Knowledge Bases

AWS Machine Learning

FEBRUARY 21, 2025

While these models are trained on vast amounts of generic data, they often lack the organization-specific context and up-to-date information needed for accurate responses in business settings. The function checks the semantic cache (Amazon Bedrock Knowledge Bases) using the Retrieve API.

Engineering

Engineering APIs Analytics Enterprise

Transforming credit decisions using generative AI with Rich Data Co and AWS

AWS Machine Learning

FEBRUARY 10, 2025

They provide access to external data and APIs or enable specific actions and computation. To improve accuracy, we tested model fine-tuning, training the model on common queries and context (such as database schemas and their definitions). Before joining RDC, he served as a Lead Data Scientist at KPMG, advising clients globally.

Chief Customer Officer

Chief Customer Officer Banking SaaS Engineering

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

AWS Machine Learning

SEPTEMBER 3, 2024

Harnessing the power of big data has become increasingly critical for businesses looking to gain a competitive edge. However, managing the complex infrastructure required for big data workloads has traditionally been a significant challenge, often requiring specialized expertise.

Big data

Big data Engineering Accountability Real estate

Build and train computer vision models to detect car positions in images using Amazon SageMaker and Amazon Rekognition

AWS Machine Learning

AUGUST 3, 2023

Training ML algorithms for pose estimation requires a lot of expertise and custom training data. Therefore, we present two options: one that doesn’t require any ML expertise and uses Amazon Rekognition, and another that uses Amazon SageMaker to train and deploy a custom ML model.

APIs

APIs Scripts Engineering Technical Support

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

AWS Machine Learning

OCTOBER 2, 2024

It’s a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like Anthropic, Cohere, Meta, Mistral AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Government

Government APIs Enterprise Best practices

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning

SEPTEMBER 18, 2024

An example direct acyclic graph (DAG) might automate data ingestion, processing, model training, and deployment tasks, ensuring that each step is run in the correct order and at the right time. Additionally, Feast promotes feature reuse, so the time spent on data preparation is reduced greatly.

APIs

APIs Engineering Analytics Marketing

TechSee Launches “Visual Intelligence Platform”

TechSee

JULY 20, 2022

This platform removes the common barriers to adopting best-in-class computer vision technologies, as organizations can train their own customized computer vision models through the VI Studio and deploy practical automation using these models across TechSee’s suite of visual service products. cial intelligence and big data.

APIs

APIs Self service Enterprise Big data

Generate images from text with the stable diffusion model on Amazon SageMaker JumpStart

AWS Machine Learning

NOVEMBER 7, 2022

JumpStart provides one-click fine-tuning and deployment of a wide variety of pre-trained models across popular ML tasks, as well as a selection of end-to-end solutions that solve common business problems. In this post, we provide a step-by-step walkthrough on how to deploy pre-trained stable diffusion models for generating images from text.

APIs

APIs Scripts Enterprise Big data

Run text generation with GPT and Bloom models on Amazon SageMaker JumpStart

AWS Machine Learning

NOVEMBER 7, 2022

JumpStart provides one-click fine-tuning and deployment of a wide variety of pre-trained models across popular ML tasks, as well as a selection of end-to-end solutions that solve common business problems. In this post, we provide a step-by-step walkthrough on how to deploy pre-trained text generation models.

APIs

APIs Scripts Enterprise Big data

Generating value from enterprise data: Best practices for Text2SQL and generative AI

AWS Machine Learning

JANUARY 4, 2024

NLP SQL enables business users to analyze data and get answers by typing or speaking questions in natural language, such as the following: “Show total sales for each product last month” “Which products generated more revenue?” Fine-tuning directly trains the model on the end task but requires many text-SQL examples.

Best practices

Best practices Enterprise Engineering APIs

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

AWS Machine Learning

AUGUST 2, 2024

An approach to product stewardship with generative AI Large language models (LLMs) are trained with vast amounts of information crawled from the internet, capturing considerable knowledge from multiple domains. However, their knowledge is static and tied to the data used during the pre-training phase.

APIs

APIs Analytics Chatbots Engineering

Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock

AWS Machine Learning

FEBRUARY 29, 2024

The Retrieve and RetrieveAndGenerate APIs allow your applications to directly query the index using a unified and standard syntax without having to learn separate APIs for each different vector database, reducing the need to write custom index queries against your vector store.

APIs

APIs Healthcare Scripts Enterprise

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

AWS Machine Learning

NOVEMBER 17, 2023

Generative AI models have the potential to revolutionize enterprise operations, but businesses must carefully consider how to harness their power while overcoming challenges such as safeguarding data and ensuring the quality of AI-generated content. Amazon SageMaker enables enterprises to build, train, and deploy machine learning (ML) models.

APIs

APIs Enterprise Big data Chatbots

Identify objections in customer conversations using Amazon Comprehend to enhance customer experience without ML expertise

AWS Machine Learning

APRIL 24, 2023

Amazon Comprehend is a fully managed and continuously trained natural language processing (NLP) service that can extract insight about the content of a document or text. accuracy by training on 800 data points and testing on 300 data points. API Gateway bypasses the request to Lambda.

Customer Experience

Customer Experience APIs Feedback Chatbots

Run machine learning enablement events at scale using AWS DeepRacer multi-user account mode

AWS Machine Learning

SEPTEMBER 21, 2022

This requires more accessible ML training, speaking to a larger number of people with diverse backgrounds. You can get started with RL quickly with hands-on tutorials that guide you through the basics of training RL models and testing them in an exciting, autonomous car racing experience. “We

Accountability

Accountability APIs Enterprise Big data

Securing MLflow in AWS: Fine-grained access control with AWS native services

AWS Machine Learning

MAY 8, 2023

In a previous post , we discussed MLflow and how it can run on AWS and be integrated with SageMaker—in particular, when tracking training jobs as experiments and deploying a model registered in MLflow to the SageMaker managed infrastructure. How to use MLflow as a centralized repository in a multi-account setup.

APIs

APIs Government Accountability Scripts

Enable fully homomorphic encryption with Amazon SageMaker endpoints for secure, real-time inferencing

AWS Machine Learning

MARCH 23, 2023

Although this example shows how to perform this for inference operations, you can extend the solution to training and other ML steps. Endpoints are deployed with a couple clicks or lines of code using SageMaker, which simplifies the process for developers and ML experts to build and train ML and deep learning models in the cloud.

Scripts

Scripts APIs Government Analytics

Intelligent document processing with AWS AI and Analytics services in the insurance industry: Part 2

AWS Machine Learning

NOVEMBER 3, 2022

The phases we discuss in this post use the following key services: Amazon Comprehend Medical is a HIPAA-eligible natural language processing (NLP) service that uses machine learning (ML) models that have been pre-trained to understand and extract health data from medical text, such as prescriptions, procedures, or diagnoses.

Analytics

Analytics APIs Healthcare Big data

Run image segmentation with Amazon SageMaker JumpStart

AWS Machine Learning

AUGUST 26, 2022

JumpStart provides one-click fine-tuning and deployment of a wide variety of pre-trained models across popular ML tasks, as well as a selection of end-to-end solutions that solve common business problems. Fine-tune pre-trained models – JumpStart allows you to fine-tune pre-trained models with no need to write your own training algorithm.

APIs

APIs Scripts Enterprise Big data

Identify rooftop solar panels from satellite imagery using Amazon Rekognition Custom Labels

AWS Machine Learning

JULY 19, 2022

You simply provide images with the appropriate labels, train the model, and deploy without having to build the model and fine-tune it. In this post, we show how to label, train, and build a computer vision model to detect rooftops and solar panels from satellite images. Use Amazon Rekognition to train the model with custom labels.

APIs

APIs Metrics Enterprise Marketing

Model management for LoRA fine-tuned models using Llama2 and Amazon SageMaker

AWS Machine Learning

NOVEMBER 14, 2023

In the era of big data and AI, companies are continually seeking ways to use these technologies to gain a competitive edge. At the core of these cutting-edge solutions lies a foundation model (FM), a highly advanced machine learning model that is pre-trained on vast amounts of data.

Management

Management Best practices APIs Calibration

Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction

AWS Machine Learning

DECEMBER 21, 2022

As feature data grows in size and complexity, data scientists need to be able to efficiently query these feature stores to extract datasets for experimentation, model training, and batch scoring. The offline store is primarily used for batch predictions and model training.

Scripts

Scripts Engineering APIs Big data

Use Amazon SageMaker pipeline sharing to view or manage pipelines across AWS accounts

AWS Machine Learning

AUGUST 29, 2022

You can now use cross-account support for Amazon SageMaker Pipelines to share pipeline entities across AWS accounts and access shared pipelines directly through Amazon SageMaker API calls. The data scientist is now able to describe and monitor the test pipeline run status using SageMaker API calls from the dev account.

Accountability

Accountability APIs Management Engineering

Access Snowflake data using OAuth-based authentication in Amazon SageMaker Data Wrangler

AWS Machine Learning

MARCH 22, 2023

We also detail the steps that data scientists can take to configure the data flow, analyze the data quality, and add data transformations. Finally, we show how to export the data flow and train a model using SageMaker Autopilot. Configure Snowflake. Configure SageMaker Studio.

APIs

APIs Engineering Analytics Scripts

Accelerating time-to-insight with MongoDB time series collections and Amazon SageMaker Canvas

AWS Machine Learning

DECEMBER 18, 2023

The SageMaker Canvas UI lets you seamlessly integrate data sources from the cloud or on-premises, merge datasets effortlessly, train precise models, and make predictions with emerging data—all without coding. Solution overview Users persist their transactional time series data in MongoDB Atlas. Note we have two folders.

Finance

Finance APIs Big data Analytics

How Amp on Amazon used data to increase customer engagement, Part 2: Building a personalized show recommendation platform using Amazon SageMaker

AWS Machine Learning

SEPTEMBER 9, 2022

It stores history of ML features in the offline store (Amazon S3) and also provides APIs to an online store to allow low-latency reads of most recent features. SageMaker lets you train and upload ML models and host them by creating and configuring SageMaker endpoints. Data Engineer for Amp on Amazon. Real-time inference.

Personalization

Personalization APIs Metrics Engineering

Onboard users to Amazon SageMaker Studio with Active Directory group-specific IAM roles

AWS Machine Learning

JUNE 19, 2023

Amazon SageMaker Studio is a web-based integrated development environment (IDE) for machine learning (ML) that lets you build, train, debug, deploy, and monitor your ML models. When the AD user is assigned to an AD group, an IAM Identity Center API ( CreateGroupMembership ) is invoked, and SSO group membership is created.

APIs

APIs Accountability Construction Enterprise

Reduce food waste to improve sustainability and financial results in retail with Amazon Forecast

AWS Machine Learning

OCTOBER 27, 2022

In addition, features such as predictor retraining can reduce training time and cost by up to 50%. By separating popular from unpopular items and training predictors, we found that predictors can fit the dataset better and enhance model accuracy with different statistical distributions. Solution overview. Summary and next steps.

APIs

APIs Best practices Metrics Government

Fine-tune and deploy a summarizer model using the Hugging Face Amazon SageMaker containers bringing your own script

AWS Machine Learning

JULY 29, 2022

Pre-trained models and fully managed NLP services have democratised access and adoption of NLP. Amazon Comprehend is a fully managed service that can perform NLP tasks like custom entity recognition, topic modelling, sentiment analysis and more to extract insights from data without the need of any prior ML experience.

Scripts

Scripts APIs Big data Engineering

MLOps foundation roadmap for enterprises with Amazon SageMaker

AWS Machine Learning

JUNE 24, 2022

As enterprise businesses embrace machine learning (ML) across their organizations, manual workflows for building, training, and deploying ML models tend to become bottlenecks to innovation. Initial phase: During this phase, the data scientists are able to experiment and build, train, and deploy models on AWS using SageMaker services.

Enterprise

Enterprise Engineering Accountability APIs

A review of purpose-built accelerators for financial services

AWS Machine Learning

SEPTEMBER 11, 2024

As shown in the preceding figure, the ML paradigm is learning (training) followed by inference. In this example figure, features are extracted from raw historical data, which are then are fed into a neural network (NN). Secondly, there was a massive increase in the volume of labeled data available for training.

Benchmark

Benchmark Banking Analytics Big data

Use streaming ingestion with Amazon SageMaker Feature Store and Amazon MSK to make ML-backed decisions in near-real time

AWS Machine Learning

APRIL 19, 2023

Feature Store lets you define groups of features, use batch ingestion and streaming ingestion, retrieve the latest feature values with single-digit millisecond latency for highly accurate online predictions, and extract point-in-time correct datasets for training. Model training and deployment – This aspect of our solution is straightforward.

Engineering

Engineering Analytics APIs Enterprise

Prioritizing employee well-being: An innovative approach with generative AI and Amazon SageMaker Canvas

AWS Machine Learning

JUNE 3, 2024

Then we train, build, test, and deploy the model using SageMaker Canvas, without writing any code. Solution overview SageMaker Canvas brings together a broad set of capabilities to help data professionals prepare, build, train, and deploy ML models without writing any code. For Training method , select Auto.

Surveys

Surveys Metrics Analytics Engineering

Video auto-dubbing using Amazon Translate, Amazon Bedrock, and Amazon Polly

AWS Machine Learning

JULY 15, 2024

Yaoqi Zhang is a Senior Big Data Engineer at Mission Cloud. Adrian Martin is a Big Data/Machine Learning Lead Engineer at Mission Cloud. Her current areas of interest include federated learning, distributed training, and generative AI. He has extensive experience in English/Spanish interpretation and translation.

Engineering

Engineering Entertainment Big data Benchmark

Simplify data prep for generative AI with Amazon SageMaker Data Wrangler

AWS Machine Learning

NOVEMBER 27, 2023

However, these models require massive amounts of clean, structured training data to reach their full potential. Most real-world data exists in unstructured formats like PDFs, which requires preprocessing before it can be used effectively. According to IDC , unstructured data accounts for over 80% of all business data today.

Engineering

Engineering APIs Big data Accountability

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

AWS Machine Learning

NOVEMBER 24, 2023

This emergent ability in LLMs has compelled software developers to use LLMs as an automation and UX enhancement tool that transforms natural language to a domain-specific language (DSL): system instructions, API requests, code artifacts, and more.

APIs

APIs Engineering SaaS Enterprise

MLOps for batch inference with model monitoring and retraining using Amazon SageMaker, HashiCorp Terraform, and GitLab CI/CD

AWS Machine Learning

AUGUST 29, 2023

SageMaker Python SDK is used to create or update SageMaker pipelines for training, training with hyperparameter optimization (HPO), and batch inference. SageMaker Pipelines serves as the orchestrator for ML model training and inference workflows.

Scripts

Scripts APIs Enterprise Accountability

Improve governance of your machine learning models with Amazon SageMaker

AWS Machine Learning

DECEMBER 1, 2022

Model cards enable you to standardize how models are documented, thereby achieving visibility into the lifecycle of a model, from designing, building, training, and evaluation. Auto-populate Model cards for SageMaker trained models. Upload and share model and data evaluation results. SageMaker Model Cards.

Government

Government Metrics APIs Engineering

Designing generative AI workloads for resilience

AWS Machine Learning

FEBRUARY 1, 2024

Capacity We can think about capacity in two contexts: inference and training model data pipelines. Because the interface between agents and tools is less formally defined than an API contract, you should monitor these traces not only for performance but also to capture new error scenarios.

Best practices

Best practices Engineering SaaS Accountability

Team and user management with Amazon SageMaker and AWS SSO

AWS Machine Learning

JULY 29, 2022

Amazon SageMaker Studio is a web-based integrated development environment (IDE) for machine learning (ML) that lets you build, train, debug, deploy, and monitor your ML models. The solution also uses SAML attribute mapping to populate the SAML assertion with specific access-relevant data, such as user ID and user team. Custom SAML 2.0

Management

Management APIs Accountability Construction

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Schedule Amazon SageMaker notebook jobs and manage multi-step notebook workflows using APIs

Trending Sources

Secure Amazon SageMaker Studio presigned URLs Part 2: Private API with JWT authentication

How Formula 1® uses generative AI to accelerate race-day issue resolution

Reducing hallucinations in LLM agents with a verified semantic cache using Amazon Bedrock Knowledge Bases

Transforming credit decisions using generative AI with Rich Data Co and AWS

Use LangChain with PySpark to process documents at massive scale with Amazon SageMaker Studio and Amazon EMR Serverless

Build and train computer vision models to detect car positions in images using Amazon SageMaker and Amazon Rekognition

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

TechSee Launches “Visual Intelligence Platform”

Generate images from text with the stable diffusion model on Amazon SageMaker JumpStart

Run text generation with GPT and Bloom models on Amazon SageMaker JumpStart

Generating value from enterprise data: Best practices for Text2SQL and generative AI

Cepsa Química improves the efficiency and accuracy of product stewardship using Amazon Bedrock

Use RAG for drug discovery with Knowledge Bases for Amazon Bedrock

Retrieval-Augmented Generation with LangChain, Amazon SageMaker JumpStart, and MongoDB Atlas semantic search

Identify objections in customer conversations using Amazon Comprehend to enhance customer experience without ML expertise

Run machine learning enablement events at scale using AWS DeepRacer multi-user account mode

Securing MLflow in AWS: Fine-grained access control with AWS native services

Enable fully homomorphic encryption with Amazon SageMaker endpoints for secure, real-time inferencing

Intelligent document processing with AWS AI and Analytics services in the insurance industry: Part 2

Run image segmentation with Amazon SageMaker JumpStart

Identify rooftop solar panels from satellite imagery using Amazon Rekognition Custom Labels

Model management for LoRA fine-tuned models using Llama2 and Amazon SageMaker

­­Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction

Use Amazon SageMaker pipeline sharing to view or manage pipelines across AWS accounts

Access Snowflake data using OAuth-based authentication in Amazon SageMaker Data Wrangler

Accelerating time-to-insight with MongoDB time series collections and Amazon SageMaker Canvas

How Amp on Amazon used data to increase customer engagement, Part 2: Building a personalized show recommendation platform using Amazon SageMaker

Onboard users to Amazon SageMaker Studio with Active Directory group-specific IAM roles

Reduce food waste to improve sustainability and financial results in retail with Amazon Forecast

Fine-tune and deploy a summarizer model using the Hugging Face Amazon SageMaker containers bringing your own script

MLOps foundation roadmap for enterprises with Amazon SageMaker

A review of purpose-built accelerators for financial services

Use streaming ingestion with Amazon SageMaker Feature Store and Amazon MSK to make ML-backed decisions in near-real time

Prioritizing employee well-being: An innovative approach with generative AI and Amazon SageMaker Canvas

Video auto-dubbing using Amazon Translate, Amazon Bedrock, and Amazon Polly

Simplify data prep for generative AI with Amazon SageMaker Data Wrangler

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

MLOps for batch inference with model monitoring and retraining using Amazon SageMaker, HashiCorp Terraform, and GitLab CI/CD

Improve governance of your machine learning models with Amazon SageMaker

Designing generative AI workloads for resilience

Team and user management with Amazon SageMaker and AWS SSO

Stay Connected

Speed ML development using SageMaker Feature Store and Apache Iceberg offline store compaction