Data Collection: Trajectories and Trace Replay
Replay successful agent traces to build a training set of (state, action) pairs.
Data Is the Product
Fine-tuning is 10% modelling, 90% data. The biggest quality improvements come from better datasets, not bigger models.
What an Agent Trajectory Looks Like
A trajectory captures one complete agent run:
trajectory = {
'task': 'Refactor the auth module',
'messages': [
{'role': 'system', 'content': '...'},
{'role': 'user', 'content': 'Task...'},
{'role': 'assistant', 'content': 'Plan...', 'tool_calls': [...]},
{'role': 'tool', 'content': '...'},
...
],
'outcome': 'success'
}All lessons in this course
- When Fine-Tuning Beats Prompting
- Data Collection: Trajectories and Trace Replay
- LoRA and QLoRA for Cost-Efficient Tuning
- Evaluating Tuned Models vs Base