nanoGPT

This repository contains implementations of two language models:

Bigram Language Model (bigram.py) – A simple neural network-based bigram model.
Transformer-based GPT Model (gpt.py) – A more advanced language model using self-attention.

Features

Implements a basic bigram language model with embeddings.
Implements a Transformer-based language model inspired by GPT.
Uses PyTorch for model training and inference.
Includes text generation capabilities.

Installation

Ensure you have Python installed along with the required dependencies:

pip install torch numpy

Usage

Running the Bigram Model

python bigram.py

This will train a simple bigram model and generate text.

Running the GPT Model

python gpt.py

This will train and generate text using a Transformer-based model.

File Structure

bigram.py: Implements the Bigram Language Model.
gpt.py: Implements a Transformer-based GPT-style model.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
bigram.py		bigram.py
gpt.py		gpt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nanoGPT

Features

Installation

Usage

Running the Bigram Model

Running the GPT Model

File Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

nanoGPT

Features

Installation

Usage

Running the Bigram Model

Running the GPT Model

File Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages