[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆28May 28, 2024Updated last year
Alternatives and similar repositories for OOD-Math-Reasoning
Users that are interested in OOD-Math-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆43May 22, 2025Updated 11 months ago
- Dive-into-LLMs Tutorial for Beginners☆23May 14, 2024Updated 2 years ago
- ☆13May 21, 2024Updated last year
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆22Jun 28, 2024Updated last year
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Oct 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fast spectral clustering, described in the NeurIPS'23 paper "Fast and Simple Spectral Clustering in Theory and Practice"☆17Jun 19, 2025Updated 11 months ago
- [ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.☆19Jun 2, 2024Updated last year
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆103Jan 11, 2026Updated 4 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆31Mar 5, 2024Updated 2 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- [TACL 2024] MAPS enables LLMs🤖 to mimic the human😁 translation process.☆145Jun 7, 2024Updated last year
- [AAAI 2025] Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification☆20Apr 17, 2025Updated last year
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated last year
- [ICLR24] code for LSN☆10Oct 28, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆189May 20, 2025Updated last year
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆26Mar 4, 2025Updated last year
- This the implementation of LeCo☆32Jan 20, 2025Updated last year
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆135Dec 12, 2023Updated 2 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 6 months ago
- [ICASSP 2022] Official PyTorch Implementation for "Attention Probe: Vision Transformer Distillation in the Wild" (ICASSP 2022)☆11Jan 23, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for "Fusion Label Enhancement for Multi-Label Learning" in IJCAI-ECAI 2022.☆10Apr 4, 2023Updated 3 years ago
- ☆30Jun 19, 2023Updated 2 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 8 months ago
- ☆23Oct 11, 2024Updated last year
- ☆20Nov 3, 2024Updated last year
- Bayesian Deep-Learning Structured Illumination Microscopy Enables Reliable Super-Resolution Imaging with Uncertainty Quantification☆19Apr 25, 2025Updated last year
- Clean Code concepts adapted for Python☆38Oct 6, 2022Updated 3 years ago
- ☆10Dec 21, 2022Updated 3 years ago
- ☆26Oct 24, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jan 14, 2026Updated 4 months ago
- Official Repository for ICML 2024 Paper "OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport"☆23Dec 4, 2025Updated 5 months ago
- ☆15Dec 12, 2023Updated 2 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- ☆28Aug 27, 2025Updated 8 months ago
- Cross-Attention Guided Loss-Based Deep Dual-Branch Fusion Network for Liver Tumor Classification☆15Sep 26, 2024Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated 2 years ago