0Pricing
NLP Academy · Lesson

Picking the Right Pre-trained Model

BERT, RoBERTa, DistilBERT, and more.

A Whole Family

BERT sparked a family of related models. Each one tweaks the recipe to trade speed, size, or accuracy differently.

RoBERTa: Trained Harder

RoBERTa keeps BERT's design but trains longer on more data and drops one task. The payoff is higher accuracy.

All lessons in this course

  1. Why Context Changes Word Meaning
  2. Masked Language Modeling
  3. Embedding Sentences With BERT
  4. Picking the Right Pre-trained Model
← Back to NLP Academy