[ICML 2023] Code for paper "Internally Rewarded Reinforcement Learning"
☆13Jul 21, 2023Updated 2 years ago
Alternatives and similar repositories for internally-rewarded-rl
Users that are interested in internally-rewarded-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for the LABOR (LAnguage-model-based Bimanual ORchestration) Agent.☆21Nov 23, 2024Updated last year
- ☆25Dec 8, 2022Updated 3 years ago
- [ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)☆52Mar 9, 2025Updated last year
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆196Aug 6, 2025Updated 7 months ago
- Official implementation of Zero-Hero paper☆29Feb 13, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LUMOS: Language-Conditioned Imitation Learning with World Models☆16Updated this week
- 河南方言语音识别☆13Apr 1, 2018Updated 7 years ago
- Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"☆30Mar 13, 2024Updated 2 years ago
- Lifelong Reinforcement Learning codes. Python implementation for the SR-LLRL Algorithm, proposed in our 2021 IEEE SMC Conference Paper "A…☆23Mar 30, 2022Updated 4 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- noise reduction☆17Jul 3, 2024Updated last year
- environments for reinforcement learning based on panda-gym☆19Aug 22, 2022Updated 3 years ago
- Screen capture tool to capture or grab an specific window on Desktop in Java☆11Jan 23, 2020Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- 机器学习生成音乐☆17Aug 14, 2018Updated 7 years ago
- ☆20Apr 19, 2021Updated 4 years ago
- Official implementation of Matcha-agent, https://arxiv.org/abs/2303.08268☆28Aug 25, 2024Updated last year
- codemirror extensions includes toolbar, helper, image-upload, event-emitter☆12Jan 15, 2026Updated 2 months ago
- VAE with Attention Mechanism for a more powerful representation of interactions☆21Jun 29, 2019Updated 6 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated 2 months ago
- Websockets <-> Riva proxy service. Audiocodes compatible.☆20Mar 31, 2023Updated 2 years ago
- The multiagent extension for the PDDL parser☆36Aug 12, 2019Updated 6 years ago
- ☆10Oct 3, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 声音场景识别☆20Jan 25, 2018Updated 8 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- This repository contains the code for the publication "Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Lang…☆10Oct 26, 2023Updated 2 years ago
- Implements QuickLook in Zotero☆10Dec 4, 2017Updated 8 years ago
- Adaptive Machine Learning-Based Stock Prediction using Financial Time Series Technical Indicators☆10Dec 21, 2019Updated 6 years ago
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Inference code for facebook LLaMA models with Wrapyfi support☆129Mar 16, 2023Updated 3 years ago
- Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020☆13May 2, 2022Updated 3 years ago
- Learning about objects and their properties by interacting with them☆12Oct 21, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆32Jan 30, 2023Updated 3 years ago
- A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management☆13Jan 11, 2023Updated 3 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Jul 24, 2023Updated 2 years ago
- VR Joystick Teleoperation for Isaac Lab with Meta Quest☆19May 10, 2025Updated 10 months ago
- Official implementation of LLM+MAP: Bimanual Robot Task Planning using Large Language Models (LLMs) and Planning Domain Definition Langua…☆21Mar 24, 2025Updated last year
- ☆45Feb 5, 2023Updated 3 years ago
- implementation of paper "Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners"☆20Aug 17, 2023Updated 2 years ago