RFTT: Reasoning with Reinforced Functional Token Tuning
☆29Feb 12, 2026Updated 4 months ago
Alternatives and similar repositories for RFTT
Users that are interested in RFTT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆23Jan 24, 2026Updated 5 months ago
- This repository contains the code and pre-trained models for our paper☆27Jun 29, 2025Updated last year
- Odyssey: Empowering Minecraft Agents with Open-World Skills☆396Oct 22, 2025Updated 8 months ago
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- ☆14Jun 24, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 8 months ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated 2 years ago
- Make use of 51 microcontroller to make a few small tutorial, let just contact 51 microcontroller friend understand microcontroller better☆22Sep 17, 2018Updated 7 years ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆14Mar 11, 2025Updated last year
- Reinforcement Learning from Text Feedback☆48Feb 17, 2026Updated 4 months ago
- ☆11Oct 2, 2023Updated 2 years ago
- ☆55Feb 11, 2025Updated last year
- Code for Representation Bending Paper☆17Jul 15, 2025Updated 11 months ago
- Analyzing LLM Alignment via Token distribution shift