Visualizing Transformers

This project aims to display the inner workings of very small transformers (less than 100 parameters so far).

The embedding dimension is 2 to allow for drawing the inner state as a 2D-vector. The vocabulary is small enough to simply draw all possible inputs simultaneously.

Videos are created with the manim library. https://www.manim.community/

Current project

I'm currently aiming at studying how completely random data is memorized, particularly in the MLP layers.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
convex_point_cover		convex_point_cover
getting_started		getting_started
media/videos/mechint/480p15		media/videos/mechint/480p15
visualize-transformers		visualize-transformers
.gitignore		.gitignore
README.md		README.md
manim_test.ipynb		manim_test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visualizing Transformers

Current project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Visualizing Transformers

Current project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages