Your Web News in One Place

Help Webnuz

Referal links:

Sign up for GreenGeeks web hosting
October 2, 2022 11:33 am GMT

SageMaker Lifecycle Configurations for custom environments

Preface

Amazon SageMaker is great for us data scientists and machine learning engineers for exploring data and building models. The range of preinstalled packages is great and enough for normal scenarios. But for having new and specific versions of packages "pip" or "conda" managers can be used.

What this blog will give you?

A template to bend and flex according to your requirements and get lifecycle configurations done fast.

The Problem

The sole issue is every time I turn the notebook instance off, the libraries installed manually are lost. This becomes a routine every time you start the instance. I would very much like to have my environment and libraries setup as I start instances.

Preferences

  1. I like to get my new environment started with a fresh python installation using conda. Here's a one-liner for that:

conda create -n <env_name> python=<python version>

conda create -n dev_env python=3.9

  1. Then we can activate the environment as:

conda activate <env_name>

conda activate dev_env

  1. After getting into the environment I like to install packages using the pip manager.

pip install <package name>

pip install pyarrow

Let's put this in a lifecycle configuration shell script, shall we!

In the following script we will take care of following tasks:

  1. Creation of environment with desired version of python. In this case python 3.9
  2. Installation of required libraries with pip manager. In this case pandas, pyarrow, and TensorFlow.
  3. Make newly created environment accessible for notebooks.Please read the comments in the script.
#!/bin/bashset -e# use ec2-user for operationssudo -u ec2-user -i <<'EOF'# environment creationconda create -n dev_env python=3.9 -ysource activate dev_env# library installationpip install pandaspip install pyarrowpip install tensorflow==2.9# make new environment accessibleconda install ipykernel -ysource deactivate

Script placement

1. Locate lifecycle configurations in console
Lifecycle configuration in SageMaker console

2. Create lifecycle configuration to run on notebook start
Lifecycle configuration creation

3. Add lifecycle configuration to notebook instance configurations
SageMaker notebook instance lifecycle configuration

Results

Kernel Selection!
See the kernel listed in the dropdown
Kernel dropdown check

Couple of checks on terminal
And we have what we need!

Terminal environment check

There we have it , let me know if it helped anyone. If anyone knows a better method please comment and let me know! This is only one example of endless applications of lifecycle configurations. You can access them here: SageMaker Lifecycle Configuration scripts


Original Link: https://dev.to/aws-builders/sagemaker-lifecycle-configurations-for-custom-environments-51k6

Share this article:    Share on Facebook
View Full Article

Dev To

An online community for sharing and discovering great ideas, having debates, and making friends

More About this Source Visit Dev To