huiwy / reflection-on-treesLinks
☆14Updated last year
Alternatives and similar repositories for reflection-on-trees
Users that are interested in reflection-on-trees are comparing it to the libraries listed below
Sorting:
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆30Updated last year
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆20Updated 6 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆22Updated 10 months ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- ☆18Updated last year
- ☆14Updated last year
- ☆98Updated last year
- Evaluate the Quality of Critique☆35Updated last year
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆40Updated 2 years ago
- Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https:…☆25Updated last month
- Benchmarking Benchmark Leakage in Large Language Models☆52Updated last year
- Revisiting Mid-training in the Era of RL Scaling☆62Updated 2 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆12Updated last year
- ☆51Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆75Updated last year
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆25Updated 10 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 9 months ago
- Self-Supervised Alignment with Mutual Information☆19Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆62Updated 11 months ago
- ☆30Updated 6 months ago
- ☆48Updated last month
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks☆51Updated last year
- Analyzing LLM Alignment via Token distribution shift☆16Updated last year
- Learning adapter weights from task descriptions☆19Updated last year
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- ☆41Updated last year
- ☆28Updated last year