bentoml-iris

This repository is part of the course of MLOPs for USFQ. To work with pipelines of ML, and the deployment in production, we will use BentoML. BentoML is an open-source ML model serving framework that helps developers package, deploy, and serve ML models in production easily and efficiently.

First of all, the student has to create an account in bentoml, in the next URL: https://www.bentoml.com/

Getting ready

Create Python environment using virtualenv:

virtualenv -p python3.10 myvenv310

Create Python environment using conda:

conda create --name myvenv310 python=3.10

To install Python libraries:

pip install bentoml mlflow scikit-learn

Login in bentoml using local computer:

bentoml cloud login

To verify the model is saved to the Model Store:

bentoml models list

To start MLflow tracking server:

mlflow server --host 127.0.0.1 --port 8080

Run docker

Construir la imagen Docker

docker build -t mlflow-bento:latest .

Ejecuta el contenedor con acceso interactivo y un volumen local (para persistir datos):

docker run -it --name mlflow_bento \
  -p 5000:5000 -p 3000:3000 -p 8888:8888 \
  -v $(pwd):/app \
  mlflow-bento:latest

Esto te dejará dentro de la consola del contenedor (/bin/bash), donde puedes:

python my_script.py
mlflow ui --host 0.0.0.0
bentoml serve service:svc --host 0.0.0.0

Basic commands for bentoml

To serve the model using the BentoML CLI:

bentoml serve 04_bentoml_service.py:IrisClassifier --port=3001

Make requests by curl:

curl -X 'POST'   'http://localhost:3000/predict'   -H 'accept: application/json'   -H 'Content-Type: application/json'   -d '{
  "input_data": [[
    5.9, 3.0, 5.1, 1.8
  ]]
}'

curl -X 'POST'   'http://localhost:3000/predict'   -H 'accept: application/json'   -H 'Content-Type: application/json'   -d '{
  "input_data": [[
    5.9, 3.0, 5.1, 1.8
  ]]
}'

curl -X 'POST'   'http://localhost:3000/predict'   -H 'accept: application/json'   -H 'Content-Type: application/json'   -d '{
  "input_data": [[
    5.9, 3.0, 5.1, 1.8
  ]]
}'

curl -X 'POST'   'http://localhost:3002/v1/predict'   -H 'accept: application/json'   -H 'Content-Type: application/json'   -d '{
  "input_data": [[
    5.9, 3.0, 5.1, 1.8
  ]]
}'

curl -X 'POST'   'http://localhost:3002/v2/predict'   -H 'accept: application/json'   -H 'Content-Type: application/json'   -d '{
  "input_data": [[
    5.9, 3.0, 5.1, 1.8
  ]]
}'

curl -X 'POST'   'http://localhost:3002/predict_combined/predict'   -H 'accept: application/json'   -H 'Content-Type: application/json'   -d '{
  "input_data": [[
    5.9, 3.0, 5.1, 1.8
  ]]
}'

To inspect the OpenAPI documentation to see the required schema for your service:

curl localhost:3000/docs.json

To execute a server with batching requests:

bentoml serve 07_bentoml_service_advanced.py:IrisClassifier --port=3002

To execute two endpoints and an ensemble prediction:

bentoml serve 09_bentoml_service_multiple.py:IrisClassifier --port=3003

Deploying to Production

Containerization: Build an OCI-compliant image for your ML service for deployment on any container platform:

bentoml build

BentoML provides multiple options for production deployment. Next steps:

Deploy to BentoCloud:

bentoml deploy iris_classifier:nd46dyf6kkbzr5oe -n ${DEPLOYMENT_NAME}

Update an existing deployment on BentoCloud:

bentoml deployment update --bento iris_classifier:mmd2rarxb6fexe65 ${DEPLOYMENT_NAME}

Containerize your Bento with bentoml containerize:

bentoml containerize iris_classifier:mmd2rarxb6fexe65

Push to BentoCloud with bentoml push: $ bentoml push iris_classifier:mmd2rarxb6fexe65

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
01_trainer.py		01_trainer.py
02_mlflow_models.py		02_mlflow_models.py
03_bentoml_models.py		03_bentoml_models.py
04_bentoml_service.py		04_bentoml_service.py
05_bentoml_predict.py		05_bentoml_predict.py
06_bentoml_client.py		06_bentoml_client.py
07_bentoml_service_advanced.py		07_bentoml_service_advanced.py
08_bentoml_client_batch.py		08_bentoml_client_batch.py
09_bentoml_service_multiple.py		09_bentoml_service_multiple.py
Dockerfile		Dockerfile
README.md		README.md
bentofile.yaml		bentofile.yaml
common.py		common.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bentoml-iris

Getting ready

Run docker

Basic commands for bentoml

Deploying to Production

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

bentoml-iris

Getting ready

Run docker

Basic commands for bentoml

Deploying to Production

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages