Reinforcement Learning from Text Feedback
☆36Feb 17, 2026Updated 2 months ago
Alternatives and similar repositories for rltf
Users that are interested in rltf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44Updated this week
- ☆12Jul 4, 2024Updated last year
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆19Mar 18, 2025Updated last year
- Graph Source Localization Library☆20Dec 7, 2024Updated last year
- The code of ICDM'21 paper "Deep Generation of Heterogeneous Networks"☆14Sep 1, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆14Nov 5, 2024Updated last year
- [ICLR 2026] Meta-RL Induces Exploration in Language Agents☆36Feb 1, 2026Updated 3 months ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- 收集的同济软院的项目目录。☆14Sep 18, 2023Updated 2 years ago
- ☆14Jun 24, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 6 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆14Mar 11, 2025Updated last year
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A unified, extensible, and reproducible benchmark for collaborative filtering (CF) research.☆27Jun 7, 2025Updated 11 months ago
- ARM emulator written in C++☆12Oct 4, 2014Updated 11 years ago
- The repository contains code for Non-additive reward functions in Reinforcement Learning.☆14Jul 25, 2023Updated 2 years ago
- Tongji select courses 同济抢课(捡漏)程序--适用于四轮选课☆20Jan 8, 2024Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- Text analyzer that extracts tokens from text for use in full-text search queries and indexes.☆12Nov 25, 2022Updated 3 years ago
- [WWW2023] PyTorch implementation of "DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device …☆26Mar 3, 2025Updated last year
- [ACL 2026] From Word to World: Can Large Language Models be Implicit Text-based World Models?☆61Apr 13, 2026Updated 3 weeks ago
- Code for Representation Bending Paper☆17Jul 15, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆38Oct 1, 2025Updated 7 months ago
- ☆21Sep 5, 2024Updated last year
- code☆14Dec 9, 2024Updated last year
- [NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models☆18Dec 6, 2023Updated 2 years ago
- ☆19May 11, 2023Updated 2 years ago
- ☆14Feb 12, 2024Updated 2 years ago
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆20Aug 1, 2025Updated 9 months ago
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated last year
- [NeurIPS 2023] The implementation of paper "Empowering Collaborative Filtering Generalization via Principled Adversarial Contrastive Loss…☆21Feb 21, 2024Updated 2 years ago
- A file downloader that leverages Go concurrency to download public files off the Internet☆11Jun 11, 2023Updated 2 years ago
- The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Pos…☆71Sep 29, 2025Updated 7 months ago
- Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.☆14Apr 29, 2023Updated 3 years ago
- A python package that is a wrapper for Plotly to generate football tracking and event data plots☆14Aug 11, 2021Updated 4 years ago