☆399Nov 7, 2025Updated 3 months ago
Alternatives and similar repositories for reasoning-with-sampling
Users that are interested in reasoning-with-sampling are comparing it to the libraries listed below
Sorting:
- Rethinking the Trust Region in LLM Reinforcement Learning☆39Feb 25, 2026Updated last week
- Official Repository of Native Parallel Reasoner☆102Feb 5, 2026Updated 3 weeks ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆109Jan 30, 2026Updated last month
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆66Jan 13, 2026Updated last month
- ☆21Mar 17, 2025Updated 11 months ago
- ☆15Apr 26, 2025Updated 10 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆47Aug 13, 2025Updated 6 months ago
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated last month
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆542Feb 15, 2026Updated 2 weeks ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆20Sep 18, 2025Updated 5 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated last year
- ☆22Sep 16, 2025Updated 5 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆75Jan 23, 2026Updated last month
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆44Jan 5, 2026Updated last month
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆21Oct 1, 2025Updated 5 months ago
- [ICLR 2026] Quantile Advantage Estimation for Entropy-Safe Reasoning☆23Oct 14, 2025Updated 4 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆32Feb 6, 2026Updated 3 weeks ago
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆38Feb 19, 2026Updated last week
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆34Nov 26, 2025Updated 3 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆116Dec 30, 2025Updated 2 months ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- ☆19Aug 4, 2025Updated 6 months ago
- ☆21Jul 3, 2025Updated 8 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆74Nov 4, 2025Updated 3 months ago
- AI eXplainable Inference & Search. Open Sourcing on-premise, ultra-fast latency intelligence to all.☆36Feb 28, 2025Updated last year
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆42Jun 11, 2025Updated 8 months ago
- Shaping capabilities with token-level pretraining data filtering☆83Jan 28, 2026Updated last month
- rl from zero pretrain, can it be done? yes.☆287Sep 28, 2025Updated 5 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- A JAVA package for real-time signal extraction in large multivariate time series☆15Mar 17, 2024Updated last year
- Prompts and evaluation data for LLMs on real world coding and writing tasks☆16Sep 13, 2025Updated 5 months ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- ☆19Nov 30, 2024Updated last year
- ☆21Dec 5, 2024Updated last year