Transformer Model Parameter Calculation

This repository contains the implementation of the concepts discussed in the blog post "Understanding Parameter Calculation in Transformer-Based Models: Simplified". The post provides a detailed explanation of the Transformer architecture and the intricacies involved in counting its parameters.

https://medium.com/@geosar/understanding-parameter-calculation-in-transformer-based-models-simplified-e8c7f4e059d8

Overview

The blog post breaks down the Transformer architecture into three main components:

Embedding: Converts an input image into a sequence of embedded patches.
Attention: The Attention layer.
Transfomer: Explain the transformer architecture and parameter.

Each component is explained with a focus on the transformations and the trainable parameters involved.

Implementation Details

The code in this repository is structured to reflect the modular nature of the Transformer architecture. It includes:

a break down of all the components of a Transforer to allow for total parameter calculation.

Usage details

deploy packages within the "requirements.txt"

and then run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.devcontainer		.devcontainer
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transformer Model Parameter Calculation

Overview

Implementation Details

Usage details

About

Uh oh!

Releases

Packages

Languages

License

GARRYHU/TransformerParameters

Folders and files

Latest commit

History

Repository files navigation

Transformer Model Parameter Calculation

Overview

Implementation Details

Usage details

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages