RFTT: Reasoning with Reinforced Functional Token Tuning
☆29Feb 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for RFTT
Users that are interested in RFTT are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated last month
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- [SIGGRAPH 2021] DiffAqua: A Differentiable Computational Design Pipeline for Soft Underwater Swimmers with Shape Interpolation☆37Sep 1, 2022Updated 3 years ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- Multi-Organ Foundation Model for Universal Ultrasound Image Segmentation with Task Prompt and Anatomical Prior☆16Sep 30, 2024Updated last year
- ☆18Sep 5, 2024Updated last year
- ☆13Jan 19, 2026Updated last month
- ☆12Jul 4, 2024Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆11Jan 15, 2020Updated 6 years ago
- NeurIPS 2024: Bidirectional Recurrence for Cardiac Motion Tracking with Gaussian Process Latent Coding☆16Jun 20, 2025Updated 8 months ago
- ☆13Jun 25, 2022Updated 3 years ago
- ☆11Jun 5, 2023Updated 2 years ago
- My implementation of https://arxiv.org/abs/1910.02600 in pytorch. Based on https://github.com/aamini/evidential-deep-learning☆10Jan 26, 2021Updated 5 years ago
- The original code for SCARA: Scalable Graph Neural Networks with Feature-Oriented Optimization (VLDB 2022) and Scalable Decoupling Graph …☆13Mar 8, 2024Updated last year
- ☆11May 20, 2022Updated 3 years ago
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆23Oct 14, 2025Updated 4 months ago
- Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).☆12Jan 5, 2025Updated last year
- ☆20May 24, 2025Updated 9 months ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆13Mar 11, 2025Updated 11 months ago
- ☆14Jul 18, 2025Updated 7 months ago
- 本工具包裝了DeepSeek、豆包、kimi等AI大模型,可以在群組裡指揮他們,讓他們為你工作;可以一鍵導出到word、excel、txt等,並保留美化後的樣式。☆13May 31, 2025Updated 9 months ago
- Learning to Exploit the Prior Network Knowledge for Weakly-Supervised Semantic Segmentation☆10Mar 13, 2019Updated 6 years ago
- ☆10Apr 20, 2016Updated 9 years ago
- ☆12Jun 16, 2023Updated 2 years ago
- Rendering code for ShapeNet models☆11Apr 20, 2017Updated 8 years ago