IAAR-Shanghai / SEAPLinks
☆20Updated last month
Alternatives and similar repositories for SEAP
Users that are interested in SEAP are comparing it to the libraries listed below
Sorting:
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆77Updated 5 months ago
- ARM: Adaptive Reasoning Model☆44Updated last month
- ☆21Updated 2 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆44Updated 7 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆19Updated last month
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆53Updated last year
- Code for Heima☆50Updated 2 months ago
- ☆136Updated last month
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆67Updated last month
- ☆32Updated 3 months ago
- ☆65Updated last week
- ☆48Updated this week
- Large Language Models Can Self-Improve in Long-context Reasoning☆71Updated 7 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆105Updated last month
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆74Updated 9 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆67Updated 2 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 5 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆14Updated 5 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆121Updated 3 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆60Updated this week
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆31Updated 3 weeks ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆63Updated 3 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆74Updated 3 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆36Updated 2 weeks ago
- ☆24Updated 4 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆38Updated 4 months ago
- ☆36Updated 3 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 6 months ago
- ☆125Updated last month
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆26Updated 4 months ago