gjq100 / Graph-CounselorLinks
☆20Updated 2 months ago
Alternatives and similar repositories for Graph-Counselor
Users that are interested in Graph-Counselor are comparing it to the libraries listed below
Sorting:
- ☆24Updated 10 months ago
- ☆66Updated 4 months ago
- The original Shared Recurrent Memory Transformer implementation☆30Updated last month
- ☆37Updated 2 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆80Updated last month
- ☆13Updated 7 months ago
- ☆48Updated 10 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆99Updated 2 months ago
- Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆35Updated last month
- The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆86Updated 3 weeks ago
- Lottery Ticket Adaptation☆39Updated 8 months ago
- ☆47Updated 2 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated 10 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆10Updated 7 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆20Updated last week
- Verifiers for LLM Reinforcement Learning☆69Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆120Updated 9 months ago
- A repository for research on medium sized language models.☆78Updated last year
- Resa: Transparent Reasoning Models via SAEs☆41Updated 2 months ago
- Official Repository for Task-Circuit Quantization☆22Updated 2 months ago
- CS194-196 Course Project☆15Updated 5 months ago
- Code for Let LLMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆45Updated 2 weeks ago
- ☆16Updated last year
- Process Reward Models That Think☆47Updated last month
- ☆25Updated last month
- ☆24Updated 6 months ago
- ☆20Updated 2 weeks ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆52Updated 7 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year