An actually boring introduction to knowledge neurons in transformers

The author tries to explain knowledge neurons to himself. Not much to see here...

January 16, 2022

minGPT for dummies

Trying to understand how Andrej Karpathy’s minGPT works, step by step....

October 9, 2021

An image is worth 16x16 words is worth a lot more words

Figuring out how an image might be worth 16x16 words...

August 10, 2021

Transformer toolbox

stuff you need to understand before understanding how transformers work...

August 10, 2021

My own little guide to einops

Why do feel like I’ll have to come back to this post many times...

August 9, 2021

What on earth is attention

Teaching myself about the attention mechanism in vision models...

July 30, 2021

Eden

Building the building blocks for generative art...

May 26, 2021

Visualizing adversarial examples with caricatures

Breaking down adversarial examples with carictaures...

May 26, 2021

A Few Months With Feature Visualization

Learning to reverse engineer CNNs and other stuff...

May 24, 2021

Hello DevoLearn

Google Summer of Code | Coding period week 9...

August 2, 2020