jaymody / simpleGPTLinks
Simple implementation of a GPT (training and inference) in PyTorch.
☆12Updated last year
Alternatives and similar repositories for simpleGPT
Users that are interested in simpleGPT are comparing it to the libraries listed below
Sorting:
- Like picoGPT but for BERT.☆50Updated 2 years ago
- ☆11Updated 7 months ago
- "PyTorch in Rust"☆16Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Updated 11 months ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- ☆39Updated last year
- ☆18Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- Python tools☆12Updated last year
- Tools for encoding Magic: The Gathering cards into a form suitable for AI text generation☆19Updated 4 years ago
- Use sync mode Playwright interactively, inside a Jupyter notebook☆15Updated 5 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Turn any collection of files into a dataset☆45Updated 2 years ago
- a writeup on some experiments on a sequence model for chess games☆31Updated 4 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆46Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆20Updated 11 months ago
- GPT Takes the Bar Exam☆142Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆13Updated last year
- JAX implementations of RWKV☆19Updated last year
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Updated 4 years ago
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆59Updated 3 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Summarize the top 30 most popular arXiv papers on Reddit, Hacker News and Hugging Face in the last 30 days. And post them to Slack, Twitt…☆23Updated 2 months ago
- Simple high-throughput inference library☆127Updated 4 months ago
- The Next Generation Multi-Modality Superintelligence☆70Updated last year
- ☆42Updated 2 weeks ago