[AAAI'26, Oral 🌟] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning"
☆43Jul 16, 2025Updated 8 months ago
Alternatives and similar repositories for CANOE
Users that are interested in CANOE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆21Jul 23, 2025Updated 8 months ago
- Code for "FaithLens: Detecting and Explaining Faithfulness Hallucination"☆101Jan 4, 2026Updated 2 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆134Nov 3, 2025Updated 4 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- [EMNLP'25, SAC Highlights Award 🏆] Code for "GATEAU: Selecting Influential Samples for Long Context Alignment"☆40Jun 4, 2025Updated 9 months ago
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆39Apr 28, 2025Updated 10 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆123May 6, 2025Updated 10 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- ☆25Dec 13, 2024Updated last year
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆21Feb 17, 2025Updated last year
- The Collapse of Patches☆58Dec 3, 2025Updated 3 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 11 months ago
- ☆13Aug 26, 2024Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆51Sep 4, 2025Updated 6 months ago
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆16Aug 1, 2025Updated 7 months ago
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- ☆13Feb 21, 2025Updated last year
- ☆14Oct 27, 2025Updated 4 months ago
- ☆11Dec 6, 2022Updated 3 years ago
- ☆22Nov 16, 2025Updated 4 months ago
- Papers and Related work to help learn ICL conveniently for everyone who interests.☆14Feb 28, 2024Updated 2 years ago
- TeleMem is a high-performance drop-in replacement for Mem0, featuring semantic deduplication, long-term dialogue memory, and multimodal v…☆456Jan 28, 2026Updated last month
- ☆14Oct 29, 2021Updated 4 years ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆38May 31, 2025Updated 9 months ago
- 该项目主要用来做 tcp 穿透内网(这是客户端)☆16Oct 23, 2019Updated 6 years ago
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆15Apr 21, 2025Updated 11 months ago
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆137Oct 28, 2024Updated last year
- 一句话高效搞定所有待办 Handle all to-dos efficiently in a word☆35Dec 23, 2025Updated 3 months ago
- AAAI 2025: Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs☆18Nov 9, 2024Updated last year
- ☆36Jul 7, 2025Updated 8 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Jul 4, 2025Updated 8 months ago
- [AAAI 2024] The dataset used in our paper "Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling", AAAI 2024.☆22Feb 24, 2026Updated 3 weeks ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆42Mar 23, 2023Updated 3 years ago
- ☆16Jun 25, 2025Updated 8 months ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- ☆17May 31, 2023Updated 2 years ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆76May 25, 2025Updated 9 months ago