Linear95 / SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
☆96Updated this week
Related projects ⓘ
Alternatives and complementary repositories for SPAG
- ☆113Updated 3 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆127Updated last month
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆200Updated 5 months ago
- ☆89Updated 4 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆194Updated last week
- ☆102Updated last month
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- Experiments on speculative sampling with Llama models☆117Updated last year
- ☆116Updated 5 months ago
- A simple unified framework for evaluating LLMs☆138Updated this week
- Reformatted Alignment☆112Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆73Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated this week
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆156Updated 3 months ago
- ☆111Updated last month
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆153Updated 3 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 2 weeks ago
- PASTA: Post-hoc Attention Steering for LLMs☆107Updated 2 months ago
- ☆158Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- This is the official repository for Inheritune.☆105Updated last month
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆129Updated last month
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆42Updated 4 months ago
- ☆98Updated 5 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆79Updated this week
- ☆49Updated 6 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆67Updated 5 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆138Updated 3 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆105Updated 7 months ago