MLOps Academy · Lesson

Define a Service and Its API

Expose runners through a typed service endpoint.

What a Service Is

In BentoML a Service is the unit you deploy. It wraps your model and exposes one or more API endpoints clients can call. 🛎️

Runners Run the Model

A runner is BentoML's optimized wrapper around a saved model. It handles inference in its own worker so the API stays responsive.

runner = bentoml.sklearn.get("iris_clf:latest").to_runner()

All lessons in this course

  1. Save a Model to the Bento Store
  2. Define a Service and Its API
  3. Enable Adaptive Micro-Batching
  4. Build a Bento and Containerize It
← Back to MLOps Academy