Professional AI Project Directory Structure
data/, notebooks/, src/, models/, tests/ layout, cookiecutter-data-science pattern.
Why Structure Matters
A pile of notebooks named final.ipynb, final2.ipynb, and really_final.ipynb is how AI projects die. A consistent directory structure makes projects readable, reproducible, and collaboration-friendly.
This lesson covers the widely-used Cookiecutter Data Science layout.
The data Folder
Separate data by processing stage so raw inputs are never overwritten:
data/raw/— original, immutable source datadata/processed/— cleaned, model-ready datadata/interim/— intermediate transformations
Treat data/raw as read-only — you should always be able to regenerate everything else from it.
All lessons in this course
- Professional AI Project Directory Structure
- Git for AI Projects
- Reproducibility: Seeds, Configs, and Environments
- Jupyter Notebooks Best Practices