Remove Biotechnology Remove Management Remove Scripts
article thumbnail

Pre-training genomic language models using AWS HealthOmics and Amazon SageMaker

AWS Machine Learning

Here, we use AWS HealthOmics storage as a convenient and cost-effective omic data store and Amazon Sagemaker as a fully managed machine learning (ML) service to train and deploy the model. All of this is delivered by HealthOmics, removing the burden of managing compression, tiering, metadata, and file organization from customers.

article thumbnail

Efficiently fine-tune the ESM-2 protein language model with Amazon SageMaker

AWS Machine Learning

In the following sections, we go through the steps to prepare your training data, create a training script, and run a SageMaker training job. save_to_disk(test_s3_uri) Create a training script SageMaker script mode allows you to run your custom training code in optimized machine learning (ML) framework containers managed by AWS.

Scripts 131
article thumbnail

Accelerate protein structure prediction with the ESMFold language model on Amazon SageMaker

AWS Machine Learning

This post provides an example Jupyter notebook and related scripts in the following GitHub repository. script to load the model, run the prediction, and format the output. This script includes much of the same code we used in our notebook. Yanjun Qi is a Senior Applied Science Manager at the AWS Machine Learning Solution Lab.