Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆24Oct 31, 2025Updated 6 months ago
Alternatives and similar repositories for FSPO
Users that are interested in FSPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 23, 2024Updated last year
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- Universal differential equations for ecologists☆14Apr 24, 2026Updated last week
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- ☆50Jan 7, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- GAN paper list in text generation (2017-2020) Say it Often...☆12Jul 10, 2020Updated 5 years ago
- This is the code for Visual Reasoning Sequential Attack, which is a method to jailbreak Multimodal Large Language Models Based on their v…☆65Mar 16, 2026Updated last month
- ☆24Jun 18, 2025Updated 10 months ago
- ☆11Oct 25, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆35Nov 1, 2025Updated 6 months ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆15Jan 5, 2024Updated 2 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated 2 years ago
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆35Oct 23, 2024Updated last year
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 7 months ago
- IAN: An Intelligent System for Omics Data Analysis and Discovery☆10Feb 23, 2026Updated 2 months ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆44Dec 1, 2025Updated 5 months ago
- Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)☆13Dec 23, 2019Updated 6 years ago
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆22Mar 18, 2026Updated last month
- The code implementation of MuScleLoRA (Accepted in ACL 2024)☆10Dec 1, 2024Updated last year
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆17Sep 20, 2025Updated 7 months ago
- ☆12Sep 14, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆25Aug 29, 2025Updated 8 months ago
- Procedural data generators suite for synthetic pretraining and formal reasoning☆37Updated this week
- DataSciCamp — Data Science Challenge / Competition Deadlines☆15May 26, 2020Updated 5 years ago
- Geometric Problem Solving Integrating FormalGeo Symbolic System and Hypergraph Neural Network.☆15Sep 23, 2025Updated 7 months ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆22Mar 2, 2026Updated 2 months ago
- ☆58Jun 30, 2023Updated 2 years ago
- ☆24Oct 30, 2025Updated 6 months ago
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆73Oct 28, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Analysis on the MS-MARCO leaderboard regarding the machine reading comprehension task.☆21Dec 14, 2020Updated 5 years ago
- [CCS-LAMPS'24] LLM IP Protection Against Model Merging☆16Oct 14, 2024Updated last year
- ☆22Feb 4, 2026Updated 2 months ago
- This repository will contain links to the most famous available books of ML that are online☆13Oct 15, 2024Updated last year
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆14Dec 3, 2021Updated 4 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year