hollygrimm / cs294-homeworkView external linksLinks
Assignments for CS294-112.
☆16Jul 13, 2018Updated 7 years ago
Alternatives and similar repositories for cs294-homework
Users that are interested in cs294-homework are comparing it to the libraries listed below
Sorting:
- my solutions to Berkley's Deep Reinforcement Learning Class CS294-112.☆17Mar 16, 2020Updated 5 years ago
- GUI for adjusting the geometry of Mujoco models☆19Jan 21, 2020Updated 6 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Oct 18, 2021Updated 4 years ago
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆28Sep 13, 2023Updated 2 years ago
- ☆30Jun 7, 2021Updated 4 years ago
- ☆28Apr 30, 2019Updated 6 years ago
- Python - 100天从新手到大师☆12May 7, 2020Updated 5 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- parsers for navigation data for oplab_standard and acfr_standard formats☆11Updated this week
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆34Mar 29, 2023Updated 2 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Mar 24, 2017Updated 8 years ago
- ☆42May 11, 2022Updated 3 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- ☆10Sep 3, 2020Updated 5 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- ☆11Jun 4, 2024Updated last year
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- ☆10Sep 21, 2021Updated 4 years ago
- Task Success is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors☆12Aug 11, 2024Updated last year
- ☆11Apr 10, 2019Updated 6 years ago
- Searching for a Strategy: Modelling Player Trajectories in Soccer Games using Social LSTM☆16Dec 20, 2017Updated 8 years ago
- Lecture notes of cs294-2017Fall☆10Feb 28, 2018Updated 7 years ago
- Official implementation of GLSO: Robot Design Automation (CoRL 2022)☆11Sep 21, 2022Updated 3 years ago
- ☆14Dec 15, 2025Updated 2 months ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- A collection of simple worlds☆12Nov 26, 2019Updated 6 years ago
- Teaching a quadruped robot to walk using a spiking neural network based architecture☆13May 12, 2024Updated last year
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- ☆11Sep 17, 2025Updated 4 months ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- This is a repository for the autopilot of WAM-V 20 USV model in Gazebo simulation environment provided by https://github.com/osrf/vrx☆10Apr 22, 2022Updated 3 years ago
- CENTAURO model for simulation☆11Apr 24, 2020Updated 5 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- ROS integration for Franka Emika research robots in Cognitive Robotics TU Delft☆10Jan 18, 2023Updated 3 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- yet another reinforcement learning package☆12May 24, 2022Updated 3 years ago