shenao-zhang / BARLView external linksLinks
Bayes-Adaptive RL for LLM Reasoning
☆45May 28, 2025Updated 8 months ago
Alternatives and similar repositories for BARL
Users that are interested in BARL are comparing it to the libraries listed below
Sorting:
- ☆23Sep 29, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 7 months ago
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆21Jul 30, 2025Updated 6 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Sep 22, 2025Updated 4 months ago
- ☆12Feb 26, 2025Updated 11 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Nov 7, 2025Updated 3 months ago
- pytorch implementation of "S3NET: GRAPH REPRESENTATIONAL NETWORK FOR SKETCH RECOGNITION"☆10Oct 6, 2020Updated 5 years ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated 11 months ago
- NeqLIPS: a powerful Olympiad-level inequality prover☆39Sep 7, 2025Updated 5 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- ☆26Jun 5, 2025Updated 8 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated 11 months ago
- ☆15Sep 22, 2024Updated last year
- ☆76Jan 8, 2026Updated last month
- Ongoing research project for code&math LLMs☆27Jul 4, 2025Updated 7 months ago
- A simple RNN meta-learner☆10Dec 17, 2018Updated 7 years ago
- ☆35Mar 12, 2025Updated 11 months ago
- ☆70Jun 18, 2025Updated 7 months ago
- Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach. This repository includes the implementation of…☆16Jun 1, 2024Updated last year
- ☆33Jul 9, 2025Updated 7 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- ☆19Mar 25, 2025Updated 10 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 9 months ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 10 months ago
- ☆16Jul 23, 2024Updated last year
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆24Nov 11, 2025Updated 3 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 8 months ago
- this is for fun, ain't it grand!☆21Sep 18, 2025Updated 4 months ago
- ☆123Feb 21, 2025Updated 11 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- ☆19Mar 10, 2025Updated 11 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆22Mar 18, 2025Updated 10 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 2 months ago
- Just a repository that will house some MLPs and their variants, so to avoid having to reimplement them again and again for different proj…☆45Jan 29, 2026Updated 2 weeks ago
- ☆25Nov 19, 2025Updated 2 months ago