Reinforcement Learning from Text Feedback
☆33Feb 17, 2026Updated 2 months ago
Alternatives and similar repositories for rltf
Users that are interested in rltf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆43Aug 3, 2025Updated 8 months ago
- ☆12Jul 4, 2024Updated last year
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆17Mar 18, 2025Updated last year
- ☆14Nov 5, 2024Updated last year
- [ICLR 2026] Meta-RL Induces Exploration in Language Agents☆35Feb 1, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- 收集的同济软院的项目目录。☆14Sep 18, 2023Updated 2 years ago
- ☆14Jun 24, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆14Mar 11, 2025Updated last year
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- A unified, extensible, and reproducible benchmark for collaborative filtering (CF) research.☆27Jun 7, 2025Updated 10 months ago
- ARM emulator written in C++☆12Oct 4, 2014Updated 11 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The repository contains code for Non-additive reward functions in Reinforcement Learning.☆14Jul 25, 2023Updated 2 years ago
- Tongji select courses 同济抢课(捡漏)程序--适用于四轮选课☆20Jan 8, 2024Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- Text analyzer that extracts tokens from text for use in full-text search queries and indexes.☆12Nov 25, 2022Updated 3 years ago
- [WWW2023] PyTorch implementation of "DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device …☆26Mar 3, 2025Updated last year
- [ACL 2026] From Word to World: Can Large Language Models be Implicit Text-based World Models?☆60Updated this week
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆17Aug 1, 2025Updated 8 months ago
- Code for Representation Bending Paper☆17Jul 15, 2025Updated 9 months ago
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 6 months ago
- code☆14Dec 9, 2024Updated last year
- ☆21Sep 5, 2024Updated last year
- [NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models☆18Dec 6, 2023Updated 2 years ago
- ☆19May 11, 2023Updated 2 years ago
- ☆14Feb 12, 2024Updated 2 years ago
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2023] The implementation of paper "Empowering Collaborative Filtering Generalization via Principled Adversarial Contrastive Loss…☆21Feb 21, 2024Updated 2 years ago
- A file downloader that leverages Go concurrency to download public files off the Internet☆11Jun 11, 2023Updated 2 years ago
- The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Pos…☆70Sep 29, 2025Updated 6 months ago
- Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.☆14Apr 29, 2023Updated 2 years ago
- A python package that is a wrapper for Plotly to generate football tracking and event data plots☆14Aug 11, 2021Updated 4 years ago
- jupyter / jupyterlite kernel for Haskell powered by WebAssembly☆65Updated this week
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆40Sep 8, 2025Updated 7 months ago