Unified AI Application Framework#

github_stars pypi_status actions_status documentation_status join_slack


BentoML is a framework for building reliable, scalable and cost-efficient AI applications. It comes with everything you need for model serving, application packaging, and production deployment.

Start your BentoML journey#

The BentoML documentation provides detailed guidance on the project with hands-on tutorials and examples. If you are a first-time user of BentoML, we recommend that you read the following documents in order:

  1. What is BentoML?

  2. Ecosystem

  3. Install BentoML

  4. Deploy a Transformer model with BentoML

  5. Deploy a large language model with OpenLLM and BentoML

Learn BentoML#

Gain a basic understanding of the BentoML open-source framework, its workflow, and the BentoML ecosystem.

Hands-on tutorials that help you quickly get started with BentoML by deploying AI applications with common machine learning (ML) models.

A step-by-step tour of BentoML’s components and introduce you to its philosophy. After reading, you will see what drives BentoML’s design, and know what Bentos and Runners stand for.

Best practices and example usages by the ML framework used for building your model.

Example projects demonstrating BentoML usage in a variety of different scenarios.

Dive into BentoML’s advanced features, internals, and architecture, including GPU support, inference graph, monitoring, and performance optimization.

Learn how BentoML works together with other tools and products in the Data/ML ecosystem.

Fully managed platform for deploying and scaling BentoML in the cloud.

Join us in our Slack community where thousands of AI application developers are contributing to the project and helping each other.

Stay informed#

The BentoML team uses the following channels to announce important updates like major product releases and share tutorials, case studies, as well as community news.

To receive release notifications, star and watch the BentoML project on GitHub. For release notes and detailed changelogs, see the Releases page.