CS285 Homework
☆28Dec 20, 2020Updated 5 years ago
Alternatives and similar repositories for cs285_homework_fall2020
Users that are interested in cs285_homework_fall2020 are comparing it to the libraries listed below
Sorting:
- Pytorch solutions for UC Berkeley's cs285 assignments☆155Jan 21, 2022Updated 4 years ago
- Code for the NeurIPS 2021 paper "Higher Order Kernel Mean Embeddings to Capture Filtrations of Stochastic Processes".☆10Oct 27, 2021Updated 4 years ago
- Research repo for reinforcement learning–based deep hedging of SPX & SPY options☆18Dec 8, 2025Updated 2 months ago
- Cat Detection and Breed Recognition☆16Oct 27, 2018Updated 7 years ago
- This repository contains a backend service for fetching VIX index futures data using the vix_index_futures.py library. The app.py script …☆14Mar 19, 2023Updated 2 years ago
- ☆12Apr 1, 2025Updated 11 months ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 2 years ago
- A simple Python implement of Bilateral Mesh Denoising☆10Dec 1, 2019Updated 6 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated 8 months ago
- Task Success is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors☆12Aug 11, 2024Updated last year
- ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation☆15Jun 2, 2024Updated last year
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆29Dec 3, 2025Updated 3 months ago
- Robust deep hedging and Non-linear generalized affine processes☆14Mar 7, 2025Updated last year
- Introductory workshop to modeling and model fitting in cognitive and computational neuroscience☆15Nov 21, 2019Updated 6 years ago
- ☆10Jul 5, 2021Updated 4 years ago
- Official implementation of the DiffSkill and PASTA algorithms for long-horizon, skill-based deformable object manipulation.☆14Feb 25, 2023Updated 3 years ago
- TF-IDF with Spark for the Kaggle popcorn competition☆10Jul 1, 2015Updated 10 years ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 3 years ago
- ☆12Jul 3, 2017Updated 8 years ago
- ☆23Feb 24, 2023Updated 3 years ago
- Apply move_base and teb planner to pedsim_ros☆14Nov 26, 2024Updated last year
- ☆13Feb 28, 2022Updated 4 years ago
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆13Jul 11, 2022Updated 3 years ago
- ☆15Nov 3, 2022Updated 3 years ago
- ☆11Mar 18, 2021Updated 4 years ago
- ☆11Jul 5, 2020Updated 5 years ago
- ☆16Dec 30, 2019Updated 6 years ago
- ☆16Aug 6, 2024Updated last year
- Benchmark on interactive safety☆12Dec 4, 2019Updated 6 years ago
- ☆15Mar 8, 2024Updated last year
- High-level Python Particle Sequential Convex Programming Model Predictive Control (SCP PMPC) interface☆17Oct 30, 2023Updated 2 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆12Jul 4, 2022Updated 3 years ago
- ☆15Oct 2, 2023Updated 2 years ago
- Prediction model for Kaggle/Rossmann competition.☆13Nov 23, 2015Updated 10 years ago
- Markov decision processes under model uncertainty☆17Jun 15, 2022Updated 3 years ago