[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"
☆40May 22, 2025Updated 9 months ago
Alternatives and similar repositories for PolyMath
Users that are interested in PolyMath are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- ☆13May 21, 2024Updated last year
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Mar 3, 2023Updated 2 years ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated 11 months ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- ☆20Apr 16, 2025Updated 10 months ago
- [ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models☆41Jun 4, 2024Updated last year
- Evaluation utilities based on SymPy.☆21Dec 12, 2024Updated last year
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 3 months ago
- A simple and readable neural machine translation system☆24Mar 6, 2022Updated 3 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Jun 12, 2023Updated 2 years ago
- ☆45Jun 7, 2021Updated 4 years ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Jan 21, 2024Updated 2 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry☆47Jan 5, 2026Updated last month
- ☆23Nov 15, 2022Updated 3 years ago
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Sep 30, 2022Updated 3 years ago
- ☆31Nov 9, 2024Updated last year
- Rubik ESP32 esp-idf Device driver library.☆12Jul 3, 2021Updated 4 years ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆282Sep 25, 2025Updated 5 months ago
- Verilog code for a low power RFID chip that will communicate with I2C sensors.☆13Apr 18, 2014Updated 11 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆36Aug 29, 2025Updated 5 months ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆136Dec 12, 2023Updated 2 years ago
- ☆33Oct 31, 2024Updated last year
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 3 years ago
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Oct 6, 2023Updated 2 years ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated 3 weeks ago
- image retrieval using metric learning☆10Nov 22, 2022Updated 3 years ago
- LC6500DMD python control☆11Nov 15, 2016Updated 9 years ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆88Feb 15, 2025Updated last year
- ☆72Jun 10, 2025Updated 8 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆141Jul 3, 2025Updated 7 months ago
- EBAZ4205 Board FPGA project☆14Oct 20, 2023Updated 2 years ago
- Light Cube using PYNQ☆10Aug 4, 2018Updated 7 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ffmpeg strftime with milliseconds support☆10May 28, 2017Updated 8 years ago