S1s-Z / NOVALinks
[ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"
☆20Updated 2 weeks ago
Alternatives and similar repositories for NOVA
Users that are interested in NOVA are comparing it to the libraries listed below
Sorting:
- ☆101Updated 3 weeks ago
- Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning"☆28Updated 2 weeks ago
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆34Updated last month
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆120Updated last week
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆50Updated 2 months ago
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆185Updated last month
- ☆44Updated 2 months ago
- ☆163Updated last week
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 3 months ago
- ☆320Updated 3 months ago
- Hybrid Latent Reasoning via Reinforcement Learning☆120Updated 3 weeks ago
- ☆14Updated 2 years ago
- [ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment☆102Updated last week
- ☆63Updated 2 weeks ago
- ☆62Updated 3 months ago
- Official implementation of paper "Multi-Level Collaboration in Model Merging"☆41Updated 2 months ago
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆126Updated this week
- [NeurIPS 2024] GuardT2I: Defending Text-to-Image Models from Adversarial Prompts☆53Updated 3 weeks ago
- ☆84Updated last week
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆39Updated 4 months ago
- ☆136Updated 4 months ago
- ☆36Updated 11 months ago
- ☆157Updated 2 weeks ago
- [EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models☆70Updated 2 weeks ago
- OpenAPIDesigner is an open-source OpenAPI specification design tool that allows developers to design, write, and validate OpenAPI specifi…☆193Updated 3 weeks ago
- ☆107Updated this week
- ☆23Updated 9 months ago
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆80Updated last week
- 以jax为后端的类似keras的框架☆98Updated 2 years ago
- AutoRLAIF is a cutting-edge framework designed to revolutionize the fine-tuning of large language models through Reinforcement Learning …☆92Updated 7 months ago