rl-explainer
☆184Mar 9, 2026Updated last month
Alternatives and similar repositories for rl-explainer
Users that are interested in rl-explainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jan 8, 2025Updated last year
- Color Prompting for Data-Free Continual Unsupervised Domain Adaptive Person Re-Identification☆10Aug 22, 2023Updated 2 years ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆82Mar 18, 2026Updated 3 weeks ago
- Language Models as Semantic Indexers (ICML 2024)☆40May 2, 2024Updated last year
- Create Persona dataset from reddit en movie category comment☆11Aug 6, 2021Updated 4 years ago