kyegomez / Python-Package-Template
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
☆165Updated last month
Alternatives and similar repositories for Python-Package-Template:
Users that are interested in Python-Package-Template are comparing it to the libraries listed below
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆53Updated 11 months ago
- ☆172Updated 3 months ago
- Implementation of Infini-Transformer in Pytorch☆109Updated 2 months ago
- LoRA and DoRA from Scratch Implementations☆198Updated last year
- working implimention of deepseek MLA☆38Updated 2 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆149Updated 2 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆160Updated last month
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆208Updated 3 weeks ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆277Updated 2 weeks ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆162Updated 3 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆159Updated 2 months ago
- ☆181Updated this week
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆274Updated 4 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆305Updated 3 months ago
- Implementation of the Llama architecture with RLHF + Q-learning☆163Updated last month
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆84Updated this week
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆121Updated 6 months ago
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆287Updated 10 months ago
- Collection of autoregressive model implementation