☆24Jun 11, 2025Updated last year
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15May 4, 2024Updated 2 years ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 10 months ago
- Improving Token-Based World Models with Parallel Observation Prediction (ICML 2024)☆14Feb 23, 2026Updated 3 months ago
- [ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)☆56Mar 9, 2025Updated last year
- My low-quality and poor-performance codes submitted to several online judges, such as ZeroJudge, GreenJudge, UVa, TIOJ, AtCoder, CSES pro…☆15Apr 6, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PyTorch Implementations of Augmented Random Search☆17Feb 28, 2019Updated 7 years ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆247Dec 11, 2025Updated 6 months ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 7 months ago
- ☆14May 30, 2019Updated 7 years ago
- Flying bird object detection in surveillance video☆17Apr 24, 2025Updated last year
- [NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer☆11Mar 3, 2026Updated 3 months ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago
- ☆19Oct 27, 2025Updated 7 months ago
- An ergonomic, opinionated memory interface for AI agents☆39Dec 18, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Mar 29, 2026Updated 2 months ago
- Discrete Flow Matching implemented in PyTorch☆35Mar 23, 2025Updated last year
- Google Research Football MARL Benchmark and Research Toolkit☆60May 19, 2024Updated 2 years ago
- ☆10Nov 6, 2024Updated last year
- A lightweight computational physics framework, based on the organization of turboWAVE. Implements a "Simulation, PhysicsModule, ComputeTo…☆12Apr 1, 2026Updated 2 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- ☆68Jul 15, 2024Updated last year
- Uplifted Contextual Multi-Armed Bandit☆19May 4, 2022Updated 4 years ago
- Hands-on Reinforcement Learning with TensorFlow by Packt Publishing☆13Jan 15, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆17Nov 8, 2024Updated last year
- A simple 2D ball collision engine.☆12Jun 15, 2023Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- Mixtures of Gaussian Process Experts in GPflow/TensorFlow☆12Aug 1, 2022Updated 3 years ago
- Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).☆13Nov 16, 2021Updated 4 years ago
- Deep Learning (FS 2020)☆17Oct 10, 2022Updated 3 years ago
- ☆12May 17, 2021Updated 5 years ago
- Multi-group Gaussian process (MGGP)☆23Jul 24, 2024Updated last year
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆61May 8, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- JData: a language-independent data annotation for portable storage and interchange☆17Apr 21, 2026Updated last month
- ☆20Feb 18, 2022Updated 4 years ago
- A gym environment for Super Smash Bros. Melee☆27May 12, 2020Updated 6 years ago
- A modified version of the cart-pole OpenAI Gym environment for testing different control policies☆13May 4, 2026Updated last month
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆22Jun 24, 2023Updated 2 years ago
- code for icml paper: https://arxiv.org/abs/1711.03243v3☆12Jul 8, 2018Updated 7 years ago
- Model-Agnostic Meta-Learning in PyTorch☆12Jul 31, 2020Updated 5 years ago