Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆23Oct 31, 2025Updated 5 months ago
Alternatives and similar repositories for FSPO
Users that are interested in FSPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Oct 3, 2024Updated last year
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- Universal differential equations for ecologists☆14Mar 24, 2026Updated 2 weeks ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- ☆49Jan 7, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Dec 18, 2024Updated last year
- A Beamer Theme of UCAS for academic report, thesis and talk.☆18Oct 12, 2024Updated last year
- ☆15Mar 30, 2025Updated last year
- ☆24Jun 18, 2025Updated 9 months ago
- ☆11Oct 25, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆34Nov 1, 2025Updated 5 months ago
- ☆10Nov 29, 2021Updated 4 years ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆14Jan 5, 2024Updated 2 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official Implementation for "Purifying Quantization-conditioned Backdoors via Layer-wise Activation Correction with Distribution Approxim…☆12Aug 14, 2024Updated last year
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆18Nov 24, 2025Updated 4 months ago
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 6 months ago
- IAN: An Intelligent System for Omics Data Analysis and Discovery☆10Feb 23, 2026Updated last month
- ☆13Jun 25, 2025Updated 9 months ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 4 months ago
- Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)☆13Dec 23, 2019Updated 6 years ago
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆22Mar 18, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code implementation of MuScleLoRA (Accepted in ACL 2024)☆10Dec 1, 2024Updated last year
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆17Sep 20, 2025Updated 6 months ago
- MPC_controllr based on ROS☆10Feb 12, 2019Updated 7 years ago
- ☆12Sep 14, 2023Updated 2 years ago
- The official implementation of the paper "Free Fine-tuning: A Plug-and-Play Watermarking Scheme for Deep Neural Networks".☆19Apr 19, 2024Updated last year
- Procedural data generators suite for synthetic pretraining and symbolic reasoning☆36Updated this week
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆21Mar 2, 2026Updated last month
- ☆58Jun 30, 2023Updated 2 years ago
- ☆23Oct 30, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆71Oct 28, 2025Updated 5 months ago
- Analysis on the MS-MARCO leaderboard regarding the machine reading comprehension task.☆21Dec 14, 2020Updated 5 years ago
- [CCS-LAMPS'24] LLM IP Protection Against Model Merging☆16Oct 14, 2024Updated last year
- ☆24Nov 20, 2025Updated 4 months ago
- A Neural Net for Nudity Detection. Classifier only.☆19Jan 23, 2023Updated 3 years ago
- ☆25Jun 10, 2025Updated 10 months ago
- This repo contains visualization code of our ReplicaPano Dataset.☆15Feb 7, 2025Updated last year