jaymody / simpleGPTLinks

Simple implementation of a GPT (training and inference) in PyTorch.

☆12

Alternatives and similar repositories for simpleGPT

Users that are interested in simpleGPT are comparing it to the libraries listed below

Sorting:

EleutherAI / tokengrams
Efficiently computing & storing token n-grams from large corpora
☆24Updated 9 months ago
srush / drop7
☆18Updated last year
jaymody / picoBERT
Like picoGPT but for BERT.
☆50Updated 2 years ago
simonw / llm-groq
☆11Updated 5 months ago
EleutherAI / best-download
URL downloader supporting checkpointing and continuous checksumming.
☆19Updated last year
Avmb / inverse_scaling_prize_code_identifier_swap
Submission to the inverse scaling prize
☆23Updated last year
iantbutler01 / ditty
A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.
☆16Updated 8 months ago
mlabonne / chessllm
☆38Updated last year
BobMcDear / flaim
Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.
☆20Updated last month
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
mcminis1 / mr-graph
a graph definition and execution library for python
☆16Updated 2 years ago
facebookresearch / coocmap
code for paper "Accessing higher dimensions for unsupervised word translation"
☆21Updated 2 years ago
raphaelsty / textokb
Extract knowledge from raw text
☆13Updated 3 years ago
Zyphra / zcookbook
Training hybrid models for dummies.
☆25Updated 6 months ago
rahuldshetty / starcoder.js
Web browser version of StarCoder.cpp
☆45Updated last year
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
ngoyal2707 / Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆18Updated 2 years ago
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆35Updated 2 years ago
huggingface / ethics-scripts
☆14Updated 2 years ago
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
daveshap / GibberishDetector
Detecting gibberish as a type of sentiment analysis with GPT2
☆24Updated 4 years ago
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆30Updated 9 months ago
modal-labs / ci-on-modal
A sample pattern for running CI tests on Modal
☆18Updated 3 months ago
andrewgcodes / vec2vec
☆15Updated 2 years ago
ncsulsj / Robust_Summarization
☆9Updated last year
geronimi73 / mamba
☆31Updated last year
allenai / cached_path
A file utility for accessing both local and remote files through a unified interface.
☆43Updated 2 months ago
huggingface / hffs
**ARCHIVED** Filesystem interface to 🤗 Hub
☆58Updated 2 years ago
fattorib / Little-GPT
GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!
☆20Updated 2 years ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago