[AAAI'26, Oral 🌟] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning"
☆45Jul 16, 2025Updated 10 months ago
Alternatives and similar repositories for CANOE
Users that are interested in CANOE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆21Jul 23, 2025Updated 10 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆165Nov 3, 2025Updated 6 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- [CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆333Dec 15, 2025Updated 5 months ago
- [EMNLP'25, SAC Highlights Award 🏆] Code for "GATEAU: Selecting Influential Samples for Long Context Alignment"☆40Jun 4, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆39Apr 28, 2025Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆126May 6, 2025Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 7 months ago
- The Collapse of Patches☆58Dec 3, 2025Updated 5 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆23Feb 17, 2025Updated last year
- [COLING'22] Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER"☆45Sep 10, 2024Updated last year
- Code for "From Context to Skills: Can Language Models Learn from Context Skillfully? "☆221May 11, 2026Updated last week
- ☆13Aug 26, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆51Sep 4, 2025Updated 8 months ago
- Official implementation of our paper "Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Opera…☆11Sep 20, 2024Updated last year
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆20Aug 1, 2025Updated 9 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- ☆13Feb 21, 2025Updated last year
- ☆22Jan 5, 2024Updated 2 years ago
- An intelligent Application-layer DDoS defense system focused on explainability. This allows users to trace the specific cause of a reject…☆109May 5, 2026Updated 2 weeks ago
- ☆14Oct 27, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆22Nov 16, 2025Updated 6 months ago
- ☆11Dec 6, 2022Updated 3 years ago
- TeleMem is a high-performance drop-in replacement for Mem0, featuring semantic deduplication, long-term dialogue memory, and multimodal v…☆458May 8, 2026Updated 2 weeks ago
- ☆14Oct 29, 2021Updated 4 years ago
- 该项目主要用来做 tcp 穿透内网(这是客户端)☆16Oct 23, 2019Updated 6 years ago
- 一句话高效搞定所有待办 Handle all to-dos efficiently in a word☆35Dec 23, 2025Updated 5 months ago
- Bert文本分类,EMA+AD☆19May 19, 2020Updated 6 years ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆40May 31, 2025Updated 11 months ago
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆140Oct 28, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AAAI 2025: Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs☆18Nov 9, 2024Updated last year
- ☆36Jul 7, 2025Updated 10 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]☆63Jul 4, 2025Updated 10 months ago
- [AAAI 2024] The dataset used in our paper "Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling", AAAI 2024.☆22Feb 24, 2026Updated 2 months ago
- ☆16Jun 25, 2025Updated 10 months ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 4 years ago