ambujtewari / stats701-winter2021View external linksLinks
Theory of Reinforcement Learning
☆17Apr 20, 2021Updated 4 years ago
Alternatives and similar repositories for stats701-winter2021
Users that are interested in stats701-winter2021 are comparing it to the libraries listed below
Sorting:
- ☆10Oct 15, 2020Updated 5 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆28Sep 28, 2024Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆76Mar 4, 2025Updated 11 months ago
- Re-implementations of SOTA RL algorithms.☆136Sep 7, 2023Updated 2 years ago
- ☆29Oct 3, 2023Updated 2 years ago
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- ☆11Oct 10, 2017Updated 8 years ago
- sokoban solver☆10Feb 6, 2014Updated 12 years ago
- C++ code to help assign papers to reviewers, area chairs, etc in conferences like NIPS.☆14Jun 18, 2018Updated 7 years ago
- ☆18Jan 15, 2024Updated 2 years ago
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions☆14Nov 23, 2025Updated 2 months ago
- ☆17Dec 23, 2025Updated last month
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 8 months ago
- Source code for paper "PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration", Findings …☆11Jun 20, 2025Updated 7 months ago
- ☆15May 24, 2021Updated 4 years ago
- Computer Vision Project, stitching different perspective images into a single smooth panorama using Laplacian Blending.☆10Oct 28, 2017Updated 8 years ago
- ☆12Sep 15, 2021Updated 4 years ago
- PLSA for sparse matrices implemented with Numba☆11Oct 18, 2016Updated 9 years ago
- ☆13Feb 4, 2022Updated 4 years ago
- MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)☆11Jan 15, 2018Updated 8 years ago
- ☆11Dec 6, 2022Updated 3 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- ellipsoid method python code☆12Feb 12, 2024Updated 2 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated last year
- ☆13Feb 6, 2026Updated last week
- ☆12Mar 2, 2020Updated 5 years ago
- ☆10Jul 13, 2021Updated 4 years ago
- Quadruped Robot controller design and simulation on Webots☆12Apr 28, 2020Updated 5 years ago
- This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"☆15Sep 21, 2023Updated 2 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- Paper introducing deep RL approaches for conservation problems☆13Jul 22, 2022Updated 3 years ago
- Connected Component Labelling tutorial☆11Aug 11, 2016Updated 9 years ago
- Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)☆12Jun 13, 2023Updated 2 years ago
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- ☆13Mar 29, 2019Updated 6 years ago
- My algorithm templates and Markdown generator for print☆11Apr 20, 2021Updated 4 years ago
- 인스타그램 태그를 Word2vec으로 학습시킨 태그 벡터 공간입니다.☆12Aug 20, 2016Updated 9 years ago
- A boilerplate (dbs, envs, teleop, models, web-apps) for robotic learning experiments & a Pytorch Implementation of "Learning Latent Plans…☆11Oct 23, 2020Updated 5 years ago