Jellyfish042 / uncheatable_evalLinks
Evaluating LLMs with Dynamic Data
☆92Updated 2 weeks ago
Alternatives and similar repositories for uncheatable_eval
Users that are interested in uncheatable_eval are comparing it to the libraries listed below
Sorting:
- ☆124Updated last week
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆147Updated 9 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆202Updated last year
- RWKV-7: Surpassing GPT☆88Updated 6 months ago
- RWKV, in easy to read code☆72Updated 2 months ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆149Updated last month
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆42Updated 2 weeks ago
- Multipack distributed sampler for fast padding-free training of LLMs☆188Updated 9 months ago
- Fast modular code to create and train cutting edge LLMs☆66Updated last year
- ☆140Updated 6 months ago
- ☆50Updated 7 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 8 months ago
- A toolkit for scaling law research ⚖☆49Updated 4 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- Experiments on speculative sampling with Llama models☆126Updated last year
- A large-scale RWKV v6, v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to de…☆35Updated last week
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆54Updated 9 months ago
- This is the official repository for Inheritune.☆111Updated 3 months ago
- This project is established for real-time training of the RWKV model.☆49Updated last year
- A repository for research on medium sized language models.☆76Updated last year
- Unofficial implementation of AlpaGasus☆91Updated last year
- ☆34Updated 10 months ago
- Language models scale reliably with over-training and on downstream tasks☆97Updated last year
- ☆34Updated 11 months ago
- ☆114Updated 3 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆161Updated 11 months ago
- A pipeline for LLM knowledge distillation☆104Updated 2 months ago
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆75Updated 7 months ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆186Updated 2 months ago
- DPO, but faster 🚀☆42Updated 6 months ago