MLOps Academy · Lesson

Initialize DVC and Track a Dataset

Run dvc add and commit the pointer, not the data.

Install DVC First

DVC is a Python package, so a single pip command gets you started. After this you have the dvc command available in your terminal.

pip install dvc

Start Inside a Git Repo

DVC works alongside Git, not instead of it. So you run it inside an existing Git repository where your code already lives.

git init
dvc init

All lessons in this course

  1. Why Git Alone Cannot Version Data
  2. Initialize DVC and Track a Dataset
  3. Push Data to Remote Storage
  4. Roll Back to an Earlier Dataset
← Back to MLOps Academy