[AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning"
☆43Jul 16, 2025Updated 7 months ago
Alternatives and similar repositories for CANOE
Users that are interested in CANOE are comparing it to the libraries listed below
Sorting:
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆21Jul 23, 2025Updated 7 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆130Nov 3, 2025Updated 4 months ago
- Code for "FaithLens: Detecting and Explaining Faithfulness Hallucination"☆100Jan 4, 2026Updated last month
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆16Aug 1, 2025Updated 7 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 11 months ago
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆21Feb 17, 2025Updated last year
- ☆25Dec 13, 2024Updated last year
- [EMNLP'25, SAC Highlights Paper Award] Code for "GATEAU: Selecting Influential Samples for Long Context Alignment"☆40Jun 4, 2025Updated 8 months ago
- The Collapse of Patches☆58Dec 3, 2025Updated 3 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 5 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- Time Series Analysis and Its Applications, Ed 5☆20Dec 17, 2025Updated 2 months ago
- ☆36Jul 7, 2025Updated 7 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆42Mar 23, 2023Updated 2 years ago
- Homework for STAT 205A - Berkeley☆13Dec 9, 2014Updated 11 years ago
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated last month
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated last month
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- ☆16Jun 25, 2025Updated 8 months ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆46Sep 21, 2023Updated 2 years ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆73May 25, 2025Updated 9 months ago
- WORKBank Database derived from large-scale audit of worker desire and technological capability of AI agents for work.☆22Jul 23, 2025Updated 7 months ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 3 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- ☆14Dec 25, 2024Updated last year
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- ☆14Dec 18, 2024Updated last year
- HealthBench☆16Sep 15, 2025Updated 5 months ago
- ☆28Jan 5, 2026Updated last month
- Diffusing States and Matching Scores: A New Framework for Imitation Learning☆22Nov 16, 2024Updated last year
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- ☆12Jul 25, 2023Updated 2 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago