lm-sys / lm-sys.github.io
☆49Updated last week
Related projects: ⓘ
- ☆26Updated last year
- [ICLR 2024] Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding☆138Updated 6 months ago
- Evaluation and analysis code for LLM360☆75Updated 3 months ago
- Experiments on speculative sampling with Llama models☆114Updated last year
- Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆55Updated this week
- RepoQA: Evaluating Long-Context Code Understanding☆96Updated this week
- ☆77Updated this week
- Simple implementation of Speculative Sampling in NumPy for GPT-2.☆87Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆58Updated this week
- A list of LLM benchmark frameworks.☆57Updated 7 months ago
- The data processing pipeline for the Koala chatbot language model☆115Updated last year
- ☆170Updated last month
- ☆61Updated 3 weeks ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆99Updated last month
- ☆83Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆96Updated 10 months ago
- ☆110Updated 4 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆264Updated 9 months ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆68Updated 2 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆104Updated 3 months ago
- Expert Specialized Fine-Tuning☆129Updated last month
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆44Updated 3 months ago
- ☆83Updated 3 weeks ago
- Public Inflection Benchmarks☆69Updated 6 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated 8 months ago
- Evaluating LLMs with Dynamic Data☆66Updated 2 weeks ago
- ☆21Updated 9 months ago
- ☆32Updated last week
- Benchmark suite for LLMs from Fireworks.ai☆51Updated this week
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆64Updated last month