Paper notes for my PhD on Machine Learning (mostly focused on Reinforcement Learning)
☆17Jul 22, 2019Updated 6 years ago
Alternatives and similar repositories for paper_notes
Users that are interested in paper_notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- Labs for understanding and coding Standard Reinforcement Learning concepts☆60Jan 17, 2019Updated 7 years ago
- Automatic script to parse bibtex to mardown to create manageable bibliography github repository. We take the example of continual learnin…☆11Nov 1, 2020Updated 5 years ago
- Online demo of DRLViz, an interactive tool to understand decisions and memory in Deep Reinforcement Learning☆16Dec 8, 2022Updated 3 years ago
- ☆17Jun 30, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆23Apr 26, 2023Updated 3 years ago
- Docker container to compile https://github.com/udacity/CarND-Capstone☆11Nov 3, 2017Updated 8 years ago
- ☆11Nov 23, 2018Updated 7 years ago
- Real-time interaction with deep generative models. Allows for a novel Convolutional Layer Reconnection technique on pretrained or your ow…☆11Jan 4, 2024Updated 2 years ago
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- ☆13Aug 13, 2021Updated 4 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Imagination Augmented Agents TensorFlow☆26Mar 30, 2020Updated 6 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- ☆13Apr 11, 2022Updated 4 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- Pytorch implementation of SCAN: Learning Abstract Hierarchical Compositional Visual Concepts☆20Jan 27, 2018Updated 8 years ago
- Local search for NAS☆18Nov 3, 2020Updated 5 years ago
- pybullet grasping with time contrastive network embeddings☆22Jun 18, 2019Updated 6 years ago
- Incorporating Neuro-Inspired Adaptability for Continual Learning in Artificial Intelligence☆28Dec 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Sep 22, 2015Updated 10 years ago
- Simple optimal control framework for python☆13Aug 16, 2018Updated 7 years ago
- Code for Transformers are Adaptable Task Planners, CoRL 2022☆12Mar 28, 2023Updated 3 years ago
- ☆37Sep 14, 2025Updated 8 months ago
- Extension of MultivariatePolynomials to moments of multivariate measures☆15Apr 22, 2026Updated last month
- POMDP formulation of a pedestrian avoidance problem for autonomous driving☆51Apr 3, 2020Updated 6 years ago
- This is the codebase for our ICRA 2020 submission, GraphRQI: Classifying Driver Behaviors Using Graph Spectrums.☆13Dec 8, 2019Updated 6 years ago
- w.i.p.☆11Feb 8, 2021Updated 5 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Apr 25, 2014Updated 12 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 股票高频数据(数据来源:新浪)☆13Jan 29, 2020Updated 6 years ago
- [VLDB 2022] Dash application for "Navigating the Labyrinth of Time Series Anomaly Detection"☆23Mar 20, 2023Updated 3 years ago
- ☆11Sep 11, 2023Updated 2 years ago
- ☆21May 13, 2019Updated 7 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- resources as addition to the updated version of the SSL4EO-S12 dataset, cf. https://arxiv.org/abs/2503.00168☆24Mar 24, 2026Updated 2 months ago