vkurenkov / hierarchical-skill-acquisitionView external linksLinks
Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong, and Richard Socher
☆11Jun 18, 2018Updated 7 years ago
Alternatives and similar repositories for hierarchical-skill-acquisition
Users that are interested in hierarchical-skill-acquisition are comparing it to the libraries listed below
Sorting:
- This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL☆12Nov 5, 2021Updated 4 years ago
- Repository for AI Student-Teacher Abnormaly Detection☆11Jun 23, 2022Updated 3 years ago
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆25Feb 16, 2023Updated 3 years ago
- An RPG Maker MZ plugin☆12Nov 2, 2023Updated 2 years ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Matlab implementation of inverse transform sampling in 1D and 2D☆10Jan 14, 2015Updated 11 years ago
- Pytorch implementation of NASA: NEURAL ARTICULATED SHAPE APPROXIMATION☆12May 4, 2021Updated 4 years ago
- ☆16Mar 14, 2025Updated 11 months ago
- ☆11Oct 25, 2021Updated 4 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detection☆10Oct 22, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- [ACML2023] Towards Better Explanations for Object Detection☆10Jan 10, 2024Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- [Review] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environment☆10Dec 22, 2018Updated 7 years ago
- ☆11Mar 5, 2024Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated 11 months ago
- DOOM source port in Eiffel with SDL2☆11Sep 9, 2025Updated 5 months ago
- code for icml paper: https://arxiv.org/abs/1711.03243v3☆12Jul 8, 2018Updated 7 years ago
- Inverse kinematic solver (FABRIK) for a simple 3D chain☆12Apr 23, 2021Updated 4 years ago
- Reusable, Easy-to-use Uncertainty module package built with Tensorflow, Keras☆14Dec 31, 2018Updated 7 years ago
- MATLAB implementation of my Bayesian Optimization algorithms☆12Mar 17, 2018Updated 7 years ago
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- Optimization program for G-code☆12Nov 29, 2015Updated 10 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- [CVPR 2025 - HuMoGen] "MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty"☆16Mar 12, 2025Updated 11 months ago
- A short guide and example on how to fine-tune OpenAI's gpt-3.5-turbo for better roleplay☆14Aug 26, 2023Updated 2 years ago
- [ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…☆17Jun 12, 2025Updated 8 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Locust on k8s example for scalable load tests☆14Apr 16, 2022Updated 3 years ago
- research and implementations of recurrent neural networks and their applications☆14Aug 31, 2025Updated 5 months ago
- MABA stable PD controller☆17Oct 20, 2020Updated 5 years ago
- This packages provides a simple python implementation of Invariant Causal Prediction (ICP)☆13Mar 22, 2024Updated last year
- Sources for a Medium article☆11Dec 2, 2021Updated 4 years ago
- Local LLM Discord Bot☆18Jun 20, 2025Updated 7 months ago