whyNLP / tinyllamaView external linksLinks
A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.
☆13Sep 2, 2024Updated last year
Alternatives and similar repositories for tinyllama
Users that are interested in tinyllama are comparing it to the libraries listed below
Sorting:
- ☆16Dec 19, 2024Updated last year
- ☆19Dec 4, 2025Updated 2 months ago
- RADLADS training code☆36May 7, 2025Updated 9 months ago
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 7 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆157Apr 7, 2025Updated 10 months ago
- ☆16Jul 23, 2023Updated 2 years ago
- (READ ONLY MIRROR) The ProB Model Checker and Animator Plugin for Rodin☆19Jan 24, 2026Updated 2 weeks ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 4 months ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- ☆11May 11, 2023Updated 2 years ago
- ☆11May 29, 2025Updated 8 months ago
- yaml & bake☆19Jan 21, 2020Updated 6 years ago
- Shogi analysis in Python☆11Nov 28, 2015Updated 10 years ago
- ☆11Oct 24, 2017Updated 8 years ago
- ☆13Nov 27, 2025Updated 2 months ago
- ☆13Jan 23, 2026Updated 3 weeks ago
- An implementation of a general multi-layer neural network (MLP) in F#. Evaluated using data sampled from complex functions plus white noi…☆11Jun 4, 2018Updated 7 years ago
- Proxify Molotov.tv DRM to share content publicly☆10Jun 24, 2020Updated 5 years ago
- ☆12Aug 30, 2025Updated 5 months ago
- ☆13Mar 25, 2025Updated 10 months ago
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- ICLR 2023: Learning to Extrapolate: A Transductive Approach☆11Aug 15, 2023Updated 2 years ago
- Download, parse, and filter data from Literotica. Data-ready for The-Pile.☆11Sep 18, 2020Updated 5 years ago
- NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.☆16Apr 7, 2025Updated 10 months ago
- Discord bot for twitter/twitcasting/twitch tracking...☆13Nov 8, 2025Updated 3 months ago
- "mmult" example using SDSoC for PYNQ board☆11Feb 23, 2017Updated 8 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Kernel objects for scaling and format conversion within VapourSynth☆12Nov 5, 2025Updated 3 months ago
- Akka Cluster for Value-at-Risk calculation☆14May 2, 2014Updated 11 years ago
- the indexer and search engine for irchiver, see https://irchiver.com for license and other information☆14Dec 2, 2021Updated 4 years ago
- Wealth Defined/Backed Currency System☆11May 26, 2021Updated 4 years ago
- A Ruby gem to interface for the PCRE2 library☆10Aug 15, 2020Updated 5 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago