jaymody / simpleGPTLinks
Simple implementation of a GPT (training and inference) in PyTorch.
☆13Updated last year
Alternatives and similar repositories for simpleGPT
Users that are interested in simpleGPT are comparing it to the libraries listed below
Sorting:
- JAX implementations of RWKV☆19Updated 2 years ago
- ☆11Updated 9 months ago
- ☆40Updated last year
- Like picoGPT but for BERT.☆51Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- ☆20Updated last year
- MozoLM: A language model (LM) serving library☆45Updated last month
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆31Updated 2 years ago
- High-performance tokenized language data-loader for Python C++ extension☆13Updated last year
- Web browser version of StarCoder.cpp☆44Updated 2 years ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆23Updated 2 years ago
- ☆18Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- Simple high-throughput inference library☆149Updated 5 months ago
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆21Updated 3 years ago
- trying to make WebGPU a bit easier to use☆18Updated last year
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 11 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- "PyTorch in Rust"☆17Updated last year
- ☆11Updated 2 years ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- Python bindings for ggml☆146Updated last year
- Experiments with BitNet inference on CPU☆54Updated last year
- The Next Generation Multi-Modality Superintelligence☆69Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated 2 years ago