[ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: https://huggingface.co/papers/2402.07625)
☆90Nov 23, 2025Updated 3 months ago
Alternatives and similar repositories for AutoMathText
Users that are interested in AutoMathText are comparing it to the libraries listed below
Sorting:
- ☆30Dec 27, 2024Updated last year
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆464Apr 18, 2024Updated last year
- ☆169May 2, 2024Updated last year
- ☆71Oct 16, 2024Updated last year
- ☆27Jul 16, 2025Updated 8 months ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆273Apr 26, 2024Updated last year
- ☆19Apr 5, 2025Updated 11 months ago
- ☆14Mar 11, 2024Updated 2 years ago
- ☆109Jul 15, 2025Updated 8 months ago
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- ☆64Apr 9, 2024Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆317Dec 20, 2023Updated 2 years ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆33Aug 13, 2025Updated 7 months ago
- ☆567Nov 20, 2024Updated last year
- ☆36Jan 10, 2025Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆267Jul 8, 2025Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆201Dec 8, 2025Updated 3 months ago
- Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>☆77Jan 8, 2026Updated 2 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Sep 12, 2024Updated last year
- ☆15Jan 27, 2025Updated last year
- ☆16Mar 6, 2025Updated last year
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆420Apr 4, 2025Updated 11 months ago
- ☆12Jan 2, 2024Updated 2 years ago
- MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models☆454Feb 1, 2024Updated 2 years ago
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆14Aug 13, 2024Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- ☆44Sep 19, 2024Updated last year
- Awesome Triton Resources☆39Apr 27, 2025Updated 10 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Mar 9, 2026Updated last week
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 7 months ago
- A project to improve skills of large language models☆882Updated this week
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- RewardBench: the first evaluation tool for reward models.☆704Feb 16, 2026Updated last month
- Language models scale reliably with over-training and on downstream tasks☆100Apr 2, 2024Updated last year
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- Diverse Demonstrations Improve In-context Compositional Generalization☆12Jul 7, 2023Updated 2 years ago
- ☆342Jun 5, 2025Updated 9 months ago