jaymody / simpleGPTLinks
Simple implementation of a GPT (training and inference) in PyTorch.
☆12Updated last year
Alternatives and similar repositories for simpleGPT
Users that are interested in simpleGPT are comparing it to the libraries listed below
Sorting:
- ☆11Updated 7 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆20Updated 11 months ago
- ☆18Updated last year
- Summarize the top 30 most popular arXiv papers on Reddit, Hacker News and Hugging Face in the last 30 days. And post them to Slack, Twitt…☆23Updated last month
- ☆16Updated 2 years ago
- Like picoGPT but for BERT.☆50Updated 2 years ago
- "PyTorch in Rust"☆16Updated last year
- JAX implementations of RWKV☆19Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Updated 10 months ago
- ☆42Updated 4 months ago
- Learning Unum's efficient data-processing tools one cool project at a time☆12Updated 2 years ago
- Experiments with BitNet inference on CPU☆54Updated last year
- MozoLM: A language model (LM) serving library☆45Updated 3 weeks ago
- High-performance tokenized language data-loader for Python C++ extension☆13Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebook☆15Updated 5 months ago
- Tools for encoding Magic: The Gathering cards into a form suitable for AI text generation☆19Updated 4 years ago
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆59Updated 3 years ago
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆20Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 9 months ago
- Training hybrid models for dummies.☆25Updated 7 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- Real-time visualisation☆18Updated this week
- trying to make WebGPU a bit easier to use☆17Updated last year
- ☆39Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆31Updated 7 months ago
- ☆13Updated last year
- ☆11Updated last year
- Simple high-throughput inference library☆127Updated 3 months ago
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated 2 years ago