WangHanLinHenry/STeCa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WangHanLinHenry/STeCa)

WangHanLinHenry / STeCa

(ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"

☆29

Alternatives and similar repositories for STeCa

Users that are interested in STeCa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WangHanLinHenry / SPA-RL-Agent
View on GitHub
Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"
☆89Sep 13, 2025Updated 10 months ago
hemingkx / Whisper
View on GitHub
[ACL 2026] Enabling Efficient Reasoning in LLMs via Black-box Persuasive Prompting
☆22Jan 9, 2026Updated 6 months ago
loyiv / ITP
View on GitHub
Code of Paper: Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
☆16Mar 17, 2026Updated 4 months ago
Reason-Wang / NAT
View on GitHub
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆28Mar 14, 2024Updated 2 years ago
Singularity0104 / equilibrium-planner
View on GitHub
[ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling
☆13May 5, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
EIT-NLP / BLEUless_DocMT
View on GitHub
☆14Nov 19, 2024Updated last year
wangjs9 / Muffin
View on GitHub
Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)
☆17Jul 2, 2024Updated 2 years ago
cooperleong00 / ToxificationReversal
View on GitHub
Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)
☆18Oct 17, 2023Updated 2 years ago
UMass-Embodied-AGI / CHAIC
View on GitHub
[NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…
☆25May 2, 2025Updated last year
ModalityDance / Awesome-Agent-as-a-Judge
View on GitHub
"A Survey on Agent-as-a-Judge"
☆138May 11, 2026Updated 2 months ago
EIT-NLP / Connector-Selection-for-MLLM
View on GitHub
[EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…
☆17Dec 13, 2024Updated last year
yuyq18 / StepTool
View on GitHub
☆36May 24, 2025Updated last year
ByteDance-Seed / Agent-R
View on GitHub
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆174Oct 20, 2025Updated 9 months ago
EIT-NLP / AccuracyParadox-RLHF
View on GitHub
[EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…
☆13Nov 11, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
YiCheng98 / IntegrativeDecoding
View on GitHub
Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"
☆33Apr 12, 2025Updated last year
hihihihiwsf / KADE
View on GitHub
☆10Aug 16, 2022Updated 3 years ago
taolusi / SECURE
View on GitHub
ACL'2024-Main: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Languag…
☆12Sep 19, 2025Updated 10 months ago
iesl / CE2ERE
View on GitHub
Constrained learning using boxes for event-event relation extraction
☆12Aug 5, 2022Updated 3 years ago
siyuyuan / KPCE
View on GitHub
Code for our ACL 2023 paper: Causality-aware Concept Extraction based on Knowledge-guided Prompting
☆14Aug 19, 2023Updated 2 years ago
SijiaCui / play-urts
View on GitHub
☆15Oct 28, 2024Updated last year
lmsdss / IPO
View on GitHub
IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)
☆15Jun 12, 2026Updated last month
iwangjian / Coding-Tutor
View on GitHub
[ACL 2025 Findings] Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
☆90Jun 2, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pettingllms-ai / PettingLLMs
View on GitHub
[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system; [arxiv] MetaAgent-X: End-to-End Reinforcement Learning Automatic Mult…
☆205May 15, 2026Updated 2 months ago
agromanou / CRAB
View on GitHub
A benchmark for assessing the strength of causal relationships between real-world events (EMNLP 2023).
☆15Nov 23, 2023Updated 2 years ago
hemingkx / TokenSkip
View on GitHub
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆224Nov 30, 2025Updated 7 months ago
iwangjian / TRIP
View on GitHub
[TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
☆14Oct 18, 2025Updated 9 months ago
OpenCausaLab / CELLO
View on GitHub
☆22Nov 5, 2024Updated last year
chaizwj / my_Blog
View on GitHub
自己搭建的个人博客，前端用的是Vue，后端用的是SpringBoot
☆11Aug 18, 2024Updated last year
Aaron617 / AgentGen
View on GitHub
[KDD 2025] AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
☆34Nov 18, 2025Updated 8 months ago
cxzhou35 / notebook
View on GitHub
Zicx's Notebook.
☆11Nov 7, 2025Updated 8 months ago
TheAgentCompany / experiments
View on GitHub
Open sourced result for The Agent Company
☆23Jun 26, 2026Updated 3 weeks ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
iLearn-Lab / ICML24-RoboMP2
View on GitHub
[ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…
☆12Apr 4, 2026Updated 3 months ago
hemingkx / SWIFT
View on GitHub
[ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
☆70Feb 21, 2025Updated last year
wjhou / Radar
View on GitHub
[ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection
☆34Jul 23, 2025Updated 11 months ago
AgentForceTeamOfficial / UA2-Agent
View on GitHub
Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…
☆19Nov 12, 2024Updated last year
jiaming-zhou / Zero-WAM
View on GitHub
Zero-WAM, an in-context world model for zero-shot robotic task generalization
☆31Jul 8, 2026Updated last week
drdh / Synergy-RL
View on GitHub
This repository is the official implementation of Low-Rank Modular Reinforcement Learning via Muscle Synergy.
☆12Oct 27, 2022Updated 3 years ago
mehdie79 / RTM_latent_refinement
View on GitHub
☆22Jul 10, 2026Updated last week