The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration" (ICML 2026)
☆24Feb 4, 2026Updated 3 months ago
Alternatives and similar repositories for DARS
Users that are interested in DARS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆41Apr 13, 2026Updated last month
- ☆17Jul 12, 2025Updated 10 months ago
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated last year
- [ICLR 2026] JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence☆80May 9, 2026Updated 2 weeks ago
- An extension of micro mouse on WEBOTS using the flood filled algorithm, A star, Dijkstra’s and Breadth first search algorithm for moving …☆26Jun 22, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Repository of "Learning what reinforcement learning can't"☆84Dec 30, 2025Updated 4 months ago
- Python Framework for sparse neural networks☆19Apr 28, 2017Updated 9 years ago
- Metaskill: A Meta-Skill for Autonomous AI Agent Team Generation☆40Feb 23, 2026Updated 3 months ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- [ICML 2026][Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward☆97May 17, 2026Updated last week
- [AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities☆15Apr 26, 2024Updated 2 years ago
- ☆19Aug 4, 2025Updated 9 months ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- ACL21 Math Word Problem Solving with Explicit Numerical Values☆13Nov 10, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 7 months ago
- Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific a…☆19Dec 25, 2024Updated last year
- Code for the Click-Through Rate Prediction Kaggle challenge from Avazu☆11Feb 5, 2017Updated 9 years ago
- ☆72Oct 23, 2025Updated 7 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆33Mar 26, 2026Updated 2 months ago
- Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Langu…☆14Apr 7, 2025Updated last year
- Covert Keras models to Pytorch☆12Dec 22, 2018Updated 7 years ago
- ☆135May 13, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO☆34Nov 26, 2025Updated 6 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 11 months ago
- [NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario☆31Oct 5, 2025Updated 7 months ago
- Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe☆441May 12, 2026Updated 2 weeks ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- ☆11Sep 17, 2024Updated last year
- Implementation of some active contour model / Snake algorithms☆14Jan 4, 2018Updated 8 years ago
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- ☆62Apr 16, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Jan 19, 2022Updated 4 years ago
- ☆17Apr 10, 2025Updated last year
- The OlymMATH dataset☆24Jun 1, 2025Updated 11 months ago
- ☆68Oct 27, 2025Updated 6 months ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- AlignX-Family is an open-source research suite for advancing personalization in large language models-spanning data, code, models, and be…☆20Jan 12, 2026Updated 4 months ago
- This repository provides the dataset introduced by our WSSTG paper☆13Jul 21, 2019Updated 6 years ago