gjq100 / Graph-CounselorLinks
☆20Updated 3 months ago
Alternatives and similar repositories for Graph-Counselor
Users that are interested in Graph-Counselor are comparing it to the libraries listed below
Sorting:
- ☆23Updated 11 months ago
- ☆40Updated 3 months ago
- ☆67Updated 5 months ago
- The original Shared Recurrent Memory Transformer implementation☆30Updated last month
- Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆40Updated 2 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated 3 weeks ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆81Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆122Updated 10 months ago
- ☆48Updated 3 months ago
- Lottery Ticket Adaptation☆39Updated 9 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆104Updated 2 months ago
- ☆49Updated 11 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆49Updated last year
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆10Updated 8 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆94Updated last week
- ☆26Updated 2 months ago
- UQ: Assessing Language Models on Unsolved Questions☆23Updated last week
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated 11 months ago
- accompanying material for sleep-time compute paper☆107Updated 4 months ago
- ☆78Updated 11 months ago
- Verifiers for LLM Reinforcement Learning☆71Updated 4 months ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆28Updated last month
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆31Updated this week
- Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆42Updated 3 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆61Updated last month
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆41Updated 4 months ago
- ☆13Updated 8 months ago