Deep learning

An actually boring introduction to knowledge neurons in transformers

The author tries to explain knowledge neurons to himself. Not much to see here...

Trying to understand how Andrej Karpathy’s minGPT works, step by step....

Figuring out how an image might be worth 16x16 words...

stuff you need to understand before understanding how transformers work...

Why do feel like I’ll have to come back to this post many times...

Teaching myself about the attention mechanism in vision models...

Building the building blocks for generative art...

Breaking down adversarial examples with carictaures...

Learning to reverse engineer CNNs and other stuff...

Google Summer of Code | Coding period week 9...