AI Agents · Lesson

Data Collection: Trajectories and Trace Replay

Replay successful agent traces to build a training set of (state, action) pairs.

Data Is the Product

Fine-tuning is 10% modelling, 90% data. The biggest quality improvements come from better datasets, not bigger models.

What an Agent Trajectory Looks Like

A trajectory captures one complete agent run:

trajectory = {
    'task': 'Refactor the auth module',
    'messages': [
        {'role': 'system', 'content': '...'},
        {'role': 'user', 'content': 'Task...'},
        {'role': 'assistant', 'content': 'Plan...', 'tool_calls': [...]},
        {'role': 'tool', 'content': '...'},
        ...
    ],
    'outcome': 'success'
}

All lessons in this course

When Fine-Tuning Beats Prompting
Data Collection: Trajectories and Trace Replay
LoRA and QLoRA for Cost-Efficient Tuning
Evaluating Tuned Models vs Base

← Back to AI Agents