The InferenceService Resource
Declare a model deployment in a single manifest.
Meet KServe
KServe turns a trained model into a production endpoint on Kubernetes. You describe what you want, and it handles serving, scaling, and routing for you. KServe is your serving layer. 🚀
One Resource to Rule Them
Instead of writing raw Deployments and Services by hand, you create a single custom resource. The InferenceService is the one object that defines your whole serving setup.
All lessons in this course
- The InferenceService Resource
- Scale to Zero and Back Up
- Write a Custom Predictor
- KServe vs Seldon Core