Define a Pipeline with DVC Stages
Wire steps together in dvc.yaml with dependencies.
Meet dvc.yaml
DVC describes your whole pipeline in one file called dvc.yaml. It lists each stage, the command it runs, and what it depends on. 📄
Add a Stage with dvc stage add
You rarely hand-write the file. The dvc stage add command builds a stage entry for you from flags you pass on the command line.
dvc stage add -n prep \
-d data/raw.parquet -o data/prepared \
python prep.pyAll lessons in this course
- Stages: Ingest, Prep, Train, Eval
- Define a Pipeline with DVC Stages
- Cache and Skip Unchanged Steps
- Parameterize Runs with params.yaml