Dahoas / QDSyntheticData
☆11Updated last month
Related projects: ⓘ
- ☆20Updated last week
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆46Updated 5 months ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆14Updated 8 months ago
- Harmonic Datasets☆26Updated 2 months ago
- A framework for few-shot evaluation of autoregressive language models.☆23Updated 8 months ago
- ☆25Updated last month
- ☆101Updated 2 months ago
- CodeUltraFeedback: aligning large language models to coding preferences☆62Updated 2 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 8 months ago
- ☆22Updated 2 months ago
- ☆44Updated 11 months ago
- ☆47Updated 3 months ago
- Neural theorem proving tutorial, version II☆28Updated 4 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆15Updated this week
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆15Updated 3 weeks ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆62Updated last year
- Can Language Models Solve Olympiad Programming?☆92Updated last month
- ☆74Updated this week
- ☆73Updated last year
- ☆80Updated 9 months ago
- ☆48Updated 3 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆45Updated 3 months ago
- This is the official repository for all the code of TheoremLlama☆26Updated 2 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆78Updated last week
- Directional Preference Alignment☆44Updated 3 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆34Updated last year
- ☆26Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆40Updated 8 months ago
- ☆23Updated 4 months ago
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning☆41Updated 3 months ago