liyucheng09 / llm-compressive
Longitudinal Evaluation of LLMs via Data Compression
☆30Updated 7 months ago
Alternatives and similar repositories for llm-compressive:
Users that are interested in llm-compressive are comparing it to the libraries listed below
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆85Updated 3 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆75Updated 10 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆38Updated 2 months ago
- 🔥 A minimal training framework for scaling FLA models☆24Updated this week
- Code for paper "Patch-Level Training for Large Language Models"☆75Updated 2 months ago
- ☆33Updated 9 months ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆31Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆71Updated last year
- ☆93Updated 3 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆146Updated last month
- Odysseus: Playground of LLM Sequence Parallelism☆64Updated 7 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆58Updated 2 months ago
- ☆105Updated last year
- ☆69Updated this week
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆42Updated 6 months ago
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆119Updated this week
- ☆22Updated last year
- Repository of LV-Eval Benchmark☆56Updated 4 months ago
- Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind☆85Updated 10 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 10 months ago
- Low-bit optimizers for PyTorch☆125Updated last year
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆147Updated last month
- Distributed IO-aware Attention algorithm☆18Updated 4 months ago
- An easy-to-use package for implementing SmoothQuant for LLMs☆89Updated 8 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year