S1s-Z / NOVALinks
[ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"
☆20Updated 5 months ago
Alternatives and similar repositories for NOVA
Users that are interested in NOVA are comparing it to the libraries listed below
Sorting:
- ☆104Updated 7 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆139Updated 9 months ago
- [NeurIPS 2025🔥]Main source code of SRPO framework.☆186Updated last month
- [AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learni…☆43Updated 5 months ago
- ☆52Updated 4 months ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆431Updated 3 months ago
- [TMLR'25] The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆53Updated 9 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆235Updated 7 months ago
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆197Updated 7 months ago
- Official repository of DARE: dLLM Alignment and Reinforcement Executor☆153Updated last week
- ☆127Updated 3 months ago
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆101Updated 5 months ago
- [ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment☆138Updated 2 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 10 months ago
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆221Updated last year
- Official Code for MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training☆36Updated 2 months ago
- The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.☆56Updated 6 months ago
- ☆205Updated 3 weeks ago
- Selective Prompt Anchoring☆96Updated last month
- ☆58Updated last month
- Marco Search Agent for Realistic and Challenging Agentic Search☆240Updated 2 months ago
- Code for "FaithLens: Detecting and Explaining Faithfulness Hallucination"☆96Updated last week
- [ICLR'2025 Spotlight] Official repository for "SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding"☆78Updated last month
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆42Updated 2 months ago
- ☆198Updated 3 months ago
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆123Updated last month
- Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions☆200Updated last month
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆296Updated 2 months ago
- [NeurIPS 2024] GuardT2I: Defending Text-to-Image Models from Adversarial Prompts☆55Updated 7 months ago
- [NeurIPS 2025] Hybrid Latent Reasoning via Reinforcement Learning☆170Updated 4 months ago