oashua / MathAgent
Code repo for MathAgent
☆13Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for MathAgent
- This is the official repository for all the code of TheoremLlama☆30Updated 3 weeks ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated last month
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated 7 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆25Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 9 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆54Updated 4 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆30Updated 3 months ago
- Code/data for MARG (multi-agent review generation)☆30Updated 5 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆34Updated 2 weeks ago
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆51Updated 3 weeks ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆30Updated 2 months ago
- ☆17Updated 8 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated this week
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆38Updated last month
- ☆31Updated 7 months ago
- Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆15Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 8 months ago
- Official implementation of paper "General Preference Modeling with Preference Representations for Aligning Language Models" (https://arxi…☆17Updated 2 weeks ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆33Updated last year
- Evaluation of neuro-symbolic engines☆33Updated 3 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 8 months ago
- Evaluate the Quality of Critique☆35Updated 5 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆67Updated last month
- ☆57Updated last month
- ☆21Updated last month
- ☆15Updated 3 months ago