kyegomez / Python-Package-Template
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
☆167Updated 2 months ago
Alternatives and similar repositories for Python-Package-Template:
Users that are interested in Python-Package-Template are comparing it to the libraries listed below
- LoRA and DoRA from Scratch Implementations☆199Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆109Updated 3 weeks ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated 11 months ago
- ☆173Updated 3 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆210Updated last week
- minimal GRPO implementation from scratch☆65Updated 2 weeks ago
- ☆182Updated this week
- Implementation of Infini-Transformer in Pytorch☆110Updated 2 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆150Updated 3 months ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆276Updated last week
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆163Updated 2 months ago
- Train, tune, and infer Bamba model☆87Updated 2 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆90Updated 6 months ago
- working implimention of deepseek MLA☆39Updated 2 months ago
- ☆261Updated last month
- Awesome list of papers that extend Mamba to various applications.☆132Updated 3 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 8 months ago
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆209Updated 3 weeks ago
- ☆81Updated last year
- PyTorch building blocks for the OLMo ecosystem☆177Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆103Updated 4 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆108Updated 3 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆161Updated 3 months ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆80Updated last month
- The official implementation of Tensor ProducT ATTenTion Transformer (T6)☆345Updated last month
- ☆47Updated 7 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆155Updated this week
- Explorations into the recently proposed Taylor Series Linear Attention☆95Updated 7 months ago
- Code repository for Black Mamba☆243Updated last year
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆91Updated this week