oashua / MathAgent
Code repo for MathAgent
☆13Updated last year
Alternatives and similar repositories for MathAgent:
Users that are interested in MathAgent are comparing it to the libraries listed below
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 4 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated last year
- This is the official repository for all the code of TheoremLlama☆37Updated 4 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Efficient Scaling laws and collaborative pretraining.☆14Updated 2 weeks ago
- Minimum Description Length probing for neural network representations☆18Updated 2 weeks ago
- ☆23Updated 5 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 11 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆18Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆31Updated last year
- ☆20Updated 8 months ago
- ☆15Updated 6 months ago
- ☆59Updated this week
- ☆31Updated 4 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 8 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆11Updated 5 months ago
- ☆17Updated 4 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆51Updated 10 months ago
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 2 months ago
- A repository for research on medium sized language models.☆76Updated 8 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆21Updated 2 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆72Updated 3 weeks ago
- Repository for Skill Set Optimization☆12Updated 6 months ago
- Aioli: A unified optimization framework for language model data mixing☆20Updated 3 weeks ago
- ☆22Updated last month
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆27Updated 9 months ago