BBuf / RWKV-World-HF-TokenizerLinks

☆34

Alternatives and similar repositories for RWKV-World-HF-Tokenizer

Users that are interested in RWKV-World-HF-Tokenizer are comparing it to the libraries listed below

Sorting:

OpenMOSE / RWKV-Infer
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆45Updated last week
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
RobertCsordas / moe_attention
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆99Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆98Updated 11 months ago
gabrielolympie / moe-pruner
A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size
☆74Updated last month
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆39Updated 11 months ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 5 months ago
yynil / RWKVinLLAMA
☆17Updated 9 months ago
frankxwang / dpo-prefix-sharing
DPO, but faster 🚀
☆45Updated 10 months ago
Jellyfish042 / uncheatable_eval
Evaluating LLMs with Dynamic Data
☆96Updated 3 months ago
yynil / RWKVInside
☆39Updated 5 months ago
fabienfrfr / tptt
😊 TPTT: Transforming Pretrained Transformers into Titans
☆29Updated last week
Zyphra / Zyda_processing
☆39Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated last week
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Updated 8 months ago
YuchuanTian / RethinkTinyLM
[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”
☆124Updated 9 months ago
LAION-AI / riverbed
Tools for content datamining and NLP at scale
☆44Updated last year
OpenNLPLab / LASP
Linear Attention Sequence Parallelism (LASP)
☆87Updated last year
AlignInc / aligner-replication
The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction
☆22Updated last year
SprocketLab / sparse_matrix_fine_tuning
Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"
☆20Updated 2 weeks ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
lucasjinreal / ImageTokenizer
imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…
☆37Updated last year
18907305772 / FuseAI
FuseAI Project
☆87Updated 9 months ago
cahya-wirawan / rwkv-tokenizer
A fast RWKV Tokenizer written in Rust
☆54Updated 2 months ago
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆33Updated last year
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
BlinkDL / LM-Trick-Questions
Here we collect trick questions and failed tasks for open source LLMs to improve them.
☆31Updated 2 years ago
thepowerfuldeez / OLMo
My fork os allen AI's OLMo for educational purposes.
☆30Updated 10 months ago