Various transformers for FSDP research
☆38Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for transformer_central
Users that are interested in transformer_central are comparing it to the libraries listed below
Sorting:
- ☆20Nov 23, 2022Updated 3 years ago
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 5 months ago
- ☆18Apr 3, 2023Updated 2 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- ☆63Sep 23, 2024Updated last year
- A tracing JIT compiler for PyTorch☆13Dec 11, 2021Updated 4 years ago
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆73May 26, 2024Updated last year
- Library for extremely fast HTML generation from Python☆27Oct 24, 2024Updated last year
- Deep Learning CNN using FastAI for the Stanford MRNet Knee MRI diagnosis challenge☆16May 18, 2019Updated 6 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- ☆21Mar 3, 2025Updated 11 months ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆73Feb 17, 2026Updated last week
- ☆235Jun 11, 2024Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆42Aug 7, 2025Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆116Sep 27, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- Classification of Single cells by Transfer Learning☆10Oct 11, 2025Updated 4 months ago
- Attention in SRAM on Tenstorrent Grayskull☆40Jul 18, 2024Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Feb 27, 2024Updated 2 years ago
- ☆261Jul 11, 2024Updated last year
- Leverage your LangChain trace data for fine tuning☆46Aug 2, 2024Updated last year
- A smart web crawler built in Rust that uses Claude AI to select the most relevant URLs from website sitemaps based on crawling objectives…☆19Jul 9, 2025Updated 7 months ago
- WIP: Python client for Liftbridge.☆10Jul 5, 2020Updated 5 years ago
- ☆11Jul 17, 2023Updated 2 years ago
- ☆10Dec 17, 2019Updated 6 years ago
- ☆10Aug 15, 2022Updated 3 years ago
- Slimebound character mod for Slay the Spire☆14Jun 30, 2020Updated 5 years ago
- ☆93Jul 5, 2024Updated last year
- ☆11Jun 4, 2021Updated 4 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- ☆44Aug 21, 2023Updated 2 years ago
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆280Nov 24, 2025Updated 3 months ago
- ☆10Jan 30, 2026Updated last month
- OpenAI ROS☆12Mar 7, 2019Updated 6 years ago
- Computing with sed: a compiler from python to sed☆11May 24, 2019Updated 6 years ago
- ☆11Apr 14, 2022Updated 3 years ago
- 🤖Artificial intelligence classify a food 🍎 nutritional table by a simple photo. Don't eat 🍔🍕🌮...☆10May 7, 2020Updated 5 years ago
- Collection of python scripts to demonstrate asynchronous programming in python☆11May 22, 2022Updated 3 years ago