Define a Service and Its API
Expose runners through a typed service endpoint.
What a Service Is
In BentoML a Service is the unit you deploy. It wraps your model and exposes one or more API endpoints clients can call. 🛎️
Runners Run the Model
A runner is BentoML's optimized wrapper around a saved model. It handles inference in its own worker so the API stays responsive.
runner = bentoml.sklearn.get("iris_clf:latest").to_runner()All lessons in this course
- Save a Model to the Bento Store
- Define a Service and Its API
- Enable Adaptive Micro-Batching
- Build a Bento and Containerize It