Building a Reliable Form-Filling Agent
A practical case study: log into a portal, fill a multi-page form, and verify the submission.
A Realistic Project
A "fill this multi-page form" agent. We'll combine Playwright + LLM planning + retry + vision fallback.
Architecture
- LLM reads the field requirements from a JSON spec
- Playwright opens the form
- For each field, find selector + value, fill, validate
- If a field is hard to locate via DOM, fall back to vision
- Submit the form, verify success
All lessons in this course
- Browser Automation with Playwright
- Vision Models for Screen Understanding
- Computer-Use Patterns (Anthropic Computer-Use)
- Building a Reliable Form-Filling Agent