AI-Accelerated Product Development
Learning Transformers by Creating Transformers
Total time needed:
Learn how to create the original transformers from the Attention Is All You Need paper
Potential Use Cases
This shortlist is for educational uses, it will help you better understand how future transformer-based models like BERT and GPT2 work
Who is This For ?
Data Scientists learning NLP
Click on each of the following
to see details.
1. Attention Is All You Need
What is an introductory overview of Attention Is All You Need in video format?
2. Transformer — Attention is all you need
What is an accessible overview of Attention Is All You Need in article form?
3. The Illustrated Transformer
What is a more through explanation of the theory behind Transformers?
4. TRANSFORMERS FROM SCRATCH
How do I start to implement and understand the code for transformers?
5. Attention is all you need: Discovering the Transformer paper
What is another example implementation of the Attention Is All You Need paper?
6. Pytorch Transformers from Scratch (Attention is all you need)
What is one more implementation of the Attention Is All You Need paper?
7. The Annotated Transformer
What is a more thorough explanation of the theory and implementation of transformers?