Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆22Oct 31, 2025Updated 4 months ago
Alternatives and similar repositories for FSPO
Users that are interested in FSPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Oct 3, 2024Updated last year
- ☆12Sep 23, 2024Updated last year
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- GAN paper list in text generation (2017-2020) Say it Often...☆12Jul 10, 2020Updated 5 years ago
- ☆24Jun 18, 2025Updated 9 months ago
- ☆11Oct 25, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆18Nov 24, 2025Updated 3 months ago
- Efficient Scaling laws and collaborative pretraining.☆21Sep 18, 2025Updated 6 months ago
- IAN: An Intelligent System for Omics Data Analysis and Discovery☆10Feb 23, 2026Updated last month
- ☆13Jun 25, 2025Updated 8 months ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 3 months ago
- ☆16Aug 14, 2022Updated 3 years ago
- MPC_controllr based on ROS☆10Feb 12, 2019Updated 7 years ago
- ☆12Sep 14, 2023Updated 2 years ago
- The official implementation of the paper "Free Fine-tuning: A Plug-and-Play Watermarking Scheme for Deep Neural Networks".☆19Apr 19, 2024Updated last year
- Procedural symbolic reasoning data generators suite for synthetic pretraining☆34Mar 13, 2026Updated last week
- DataSciCamp — Data Science Challenge / Competition Deadlines☆15May 26, 2020Updated 5 years ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆20Mar 2, 2026Updated 3 weeks ago
- ☆23Oct 30, 2025Updated 4 months ago
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆68Oct 28, 2025Updated 4 months ago
- ☆58Jun 30, 2023Updated 2 years ago
- Analysis on the MS-MARCO leaderboard regarding the machine reading comprehension task.☆21Dec 14, 2020Updated 5 years ago
- [CCS-LAMPS'24] LLM IP Protection Against Model Merging☆16Oct 14, 2024Updated last year
- ☆24Nov 20, 2025Updated 4 months ago
- A Neural Net for Nudity Detection. Classifier only.☆18Jan 23, 2023Updated 3 years ago
- ☆22Feb 4, 2026Updated last month
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated 11 months ago
- Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale☆17Updated this week
- ☆14Aug 28, 2024Updated last year
- ☆16Jun 25, 2025Updated 8 months ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 3 months ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 5 months ago
- [ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You☆17Mar 18, 2025Updated last year