Language modeling and text generation with n-gram models and a decoder-only Transformer, implemented in Pytorch. following along @karpathy's work. more paper imolementations to come as I make my way to mechanistic interpretability.