CatBoost#

CatBoost is a machine learning algorithm that uses gradient boosting on decision trees. It is available as an open source library. To learn more about CatBoost, visit their documentation.

BentoML provides native support for CatBoost, and this guide provides an overview of how to use BentoML with CatBoost.

Saving a trained CatBoost model#

In this example, we will train a new model using UCI’s breast cancer dataset.

import bentoml

import catboost as cbt

from sklearn.datasets import load_breast_cancer

cancer = load_breast_cancer()

X = cancer.data
y = cancer.target

model = cbt.CatBoostClassifier(
    iterations=2,
    depth=2,
    learning_rate=1,
    loss_function="Logloss",
    verbose=False,
)

# train the model
model.fit(X, y)

Use save_model to save the model instance to BentoML model store:

bento_model = bentoml.catboost.save_model("catboost_cancer_clf", model)

To verify that the saved learner can be loaded properly:

model = bentoml.catboost.load_model("catboost_cancer_clf:latest")

model.predict(cbt.Pool([[1.308e+01, 1.571e+01, 8.563e+01, 5.200e+02, 1.075e-01, 1.270e-01,
    4.568e-02, 3.110e-02, 1.967e-01, 6.811e-02, 1.852e-01, 7.477e-01,
    1.383e+00, 1.467e+01, 4.097e-03, 1.898e-02, 1.698e-02, 6.490e-03,
    1.678e-02, 2.425e-03, 1.450e+01, 2.049e+01, 9.609e+01, 6.305e+02,
    1.312e-01, 2.776e-01, 1.890e-01, 7.283e-02, 3.184e-01, 8.183e-02]]))

Building a Service using CatBoost#

Using Runners#

Using GPU#

CatBoost Runners will automatically use task_type=GPU if a GPU is detected.

This behavior can be disabled using the BentoML configuration file:

access:

runners:
   # resources can be configured at the top level
   resources:
      nvidia.com/gpu: 0
   # or per runner
   my_runner_name:
      resources:
          nvidia.com/gpu: 0

CatBoost#

Saving a trained CatBoost model#

Building a Service using CatBoost#

Using Runners#

Using GPU#

Adaptive batching#