Deployment#

As the standard distribution format in the BentoML ecosystem, Bentos can be deployed in different ways. In essence, these deployment strategies rely on Bento containerization underneath.

This page explains different Bento deployment strategies.

BentoCloud#

BentoCloud is a fully-managed platform designed for building and operating AI applications. It provides comprehensive solutions for addressing deployment, scalability, and collaboration challenges in the AI application delivery lifecycle. As BentoCloud manages the underlying infrastructure for you, you only need to focus on developing AI applications. BentoCloud is currently available for early access with two plans - Starter and Enterprise. See the BentoCloud documentation to learn more.

To deploy a Bento on BentoCloud:

Create an API token with Developer Operations Access on BentoCloud.
Log in to BentoCloud with the token.
Push the Bento to BentoCloud using bentoml push.
Deploy the Bento via the BentoCloud console. Alternatively, create a Deployment configuration file in JSON and use the BentoML CLI (bentoml deployment create --file <file_name>.json) to deploy it.

For details, see Quickstart and Deploy Bentos.

Docker#

When a Bento is built, BentoML automatically creates a Dockerfile within the Bento. This allows you to containerize the Bento as a Docker image, which is useful for testing out the Bento’s environment and dependency configurations locally.

To containerize a Bento:

Make sure you have installed Docker.
Run bentoml containerize BENTO_TAG to start the containerization process. You can use bentoml list to view available Bentos locally.
Note

If you are using Mac computers with Apple silicon, you can specify the --platform option to avoid potential compatibility issues with some Python libraries.
```
bentoml containerize --opt platform=linux/amd64 BENTO_TAG
```
View the built Docker image by running docker images.
Run the generated Docker image by running docker run -p 3000:3000 IMAGE_TAG. Note that 3000 is the default port for the Bento server.