[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"
☆43May 22, 2025Updated 9 months ago
Alternatives and similar repositories for PolyMath
Users that are interested in PolyMath are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- Dive-into-LLMs Tutorial for Beginners☆12May 14, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆35Sep 26, 2024Updated last year
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated last year
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Mar 3, 2023Updated 3 years ago
- [ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models☆42Jun 4, 2024Updated last year
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- A simple and readable neural machine translation system☆24Mar 6, 2022Updated 4 years ago
- Evaluation utilities based on SymPy.☆21Dec 12, 2024Updated last year
- ☆20Apr 16, 2025Updated 11 months ago
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Sep 30, 2022Updated 3 years ago
- ☆45Jun 7, 2021Updated 4 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Jun 12, 2023Updated 2 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 4 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆287Sep 25, 2025Updated 5 months ago
- Feeling confused about super alignment? Here is a reading list☆44Jan 9, 2024Updated 2 years ago
- ☆16Oct 11, 2025Updated 5 months ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 4 months ago
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23May 4, 2023Updated 2 years ago
- ☆23Nov 15, 2022Updated 3 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- Code for "Multi-Domain Neural Machine Translation with Word-Level Domain Context Discrimination"(EMNLP2019)☆31Nov 9, 2018Updated 7 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Oct 6, 2023Updated 2 years ago
- Modern development with Python in 2024☆12Mar 9, 2026Updated last week
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆88Feb 15, 2025Updated last year
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆187May 20, 2025Updated 9 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆95Dec 19, 2024Updated last year
- ☆31Nov 9, 2024Updated last year
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated 10 months ago
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 3 years ago
- [TACL 2024] MAPS enables LLMs🤖 to mimic the human😁 translation process.☆144Jun 7, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Convert MathML to Latex for OneNote to Markdown☆12Jul 27, 2022Updated 3 years ago
- Code for "Towards Robust k-Nearest-Neighbor Machine Translation" (EMNLP 2022)☆12Oct 18, 2022Updated 3 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago