IAAR-Shanghai / SEAPLinks

☆20

Alternatives and similar repositories for SEAP

Users that are interested in SEAP are comparing it to the libraries listed below

Sorting:

horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆77Updated 5 months ago
TEAM-ARM / arm
ARM: Adaptive Reasoning Model
☆44Updated last month
aeroplanepaper / GRPO-LEAD
☆21Updated 2 months ago
Quehry / HelloBench
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆44Updated 7 months ago
rhyang2021 / ARIA
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆19Updated last month
dvlab-research / Q-LLM
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
☆53Updated last year
shawnricecake / Heima
Code for Heima
☆50Updated 2 months ago
THU-KEG / AdaptThink
☆136Updated last month
MingyuJ666 / Disentangling-Memory-and-Reasoning
[ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.
☆67Updated last month
NuoJohnChen / JudgeLRM
☆32Updated 3 months ago
yhy-2000 / VideoDeepResearch
☆65Updated last week
haon-chen / MoCa
☆48Updated this week
SihengLi99 / SEALONG
Large Language Models Can Self-Improve in Long-context Reasoning
☆71Updated 7 months ago
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆105Updated last month
tianyi-lab / MoE-Embedding
Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆74Updated 9 months ago
NineAbyss / S2R
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆67Updated 2 months ago
beichenzbc / BoostStep
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆36Updated 5 months ago
LCM-Lab / LCM_Stack
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆14Updated 5 months ago
IAAR-Shanghai / xVerify
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
☆121Updated 3 months ago
MikeWangWZHL / PAPO
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆60Updated this week
GaryStack / MMR-V
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?
☆31Updated 3 weeks ago
efficientscaling / Z1
Repo for "Z1: Efficient Test-time Scaling with Code"
☆63Updated 3 months ago
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆74Updated 3 months ago
MozerWang / AMPO
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
☆36Updated 2 weeks ago
TemporaryLoRA / Block-Attention
☆24Updated 4 months ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆38Updated 4 months ago
UCSB-NLP-Chang / ThinkPrune
☆36Updated 3 months ago
jinzhuoran / RAG-RewardBench
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆16Updated 6 months ago
GeniusHTX / TALE
☆125Updated last month
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆26Updated 4 months ago