Triang-jyed-driung / rwkv7miniView external linksLinks
RWKV-7 mini
☆12Mar 29, 2025Updated 10 months ago
Alternatives and similar repositories for rwkv7mini
Users that are interested in rwkv7mini are comparing it to the libraries listed below
Sorting:
- ☆11Oct 11, 2023Updated 2 years ago
- ☆13May 11, 2025Updated 9 months ago
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 2 months ago
- ☆41Apr 30, 2025Updated 9 months ago
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Sep 7, 2023Updated 2 years ago
- ☆24Dec 11, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- RWKV models and examples powered by candle.☆24Jan 19, 2026Updated 3 weeks ago
- ☆27Jul 28, 2025Updated 6 months ago
- Github Repository for the HOI4 ULTRA Project.☆11Updated this week
- ☆26Nov 27, 2024Updated last year
- RADLADS training code☆37May 7, 2025Updated 9 months ago
- ☆177Apr 23, 2025Updated 9 months ago
- ☆67Mar 21, 2025Updated 10 months ago
- Experiments on the impact of depth in transformers and SSMs.☆40Oct 23, 2025Updated 3 months ago
- PyTorch implementation of RWKV blocks☆32Jul 22, 2025Updated 6 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Oct 21, 2025Updated 3 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- State tuning tunes the state☆35Feb 12, 2025Updated last year
- A chat solution to mechanical 3D modelling with SOLIDWORKS -- 用于AI辅助机械3D建模的SolidWorks插件☆12Mar 14, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Gemini API: Rotate keys, break limits.☆16Oct 17, 2025Updated 4 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 5 months ago
- ☆54Dec 17, 2025Updated 2 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆94Oct 8, 2025Updated 4 months ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆54Jan 12, 2026Updated last month
- RWKV-7: Surpassing GPT☆104Nov 17, 2024Updated last year
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Mar 11, 2025Updated 11 months ago
- wasm bindings for huggingface tokenizers library☆34Jun 30, 2022Updated 3 years ago
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Apr 9, 2023Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- ☆16Jul 23, 2023Updated 2 years ago
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 7 years ago
- An agent of personal activity monitoring system for Windows desktop.☆12Sep 19, 2018Updated 7 years ago
- GPU-accelerated RIME implementations. An offshoot of the BIRO projects, and one of the foothills of Mt Exaflop.☆10Dec 10, 2025Updated 2 months ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 3 months ago