jaymody / simpleGPTLinks
Simple implementation of a GPT (training and inference) in PyTorch.
☆12Updated last year
Alternatives and similar repositories for simpleGPT
Users that are interested in simpleGPT are comparing it to the libraries listed below
Sorting:
- ☆11Updated 6 months ago
- Like picoGPT but for BERT.☆50Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- "PyTorch in Rust"☆16Updated last year
- ☆18Updated last year
- ☆41Updated 3 months ago
- JAX implementations of RWKV☆19Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Updated 10 months ago
- Training hybrid models for dummies.☆25Updated 6 months ago
- The code that runs my blog: https://blog.gpt4.org/☆9Updated 3 years ago
- Low-Rank Adaptation of Large Language Models clean implementation☆8Updated 2 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- Learning Unum's efficient data-processing tools one cool project at a time☆12Updated 2 years ago
- Turn any collection of files into a dataset☆45Updated 2 years ago
- MozoLM: A language model (LM) serving library☆45Updated this week
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated 2 years ago
- Web browser version of StarCoder.cpp☆45Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 8 months ago
- Simple high-throughput inference library☆125Updated 2 months ago
- ☆39Updated 2 years ago
- RWKV model implementation☆38Updated 2 years ago
- ☆38Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆13Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆59Updated 3 years ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆20Updated 2 months ago
- Interpretability analysis of language model outlier and attempts to distill the model☆13Updated 2 years ago