This repo mainly contains CS234 (Spring 2024) assignment's coding problems
☆58Feb 4, 2025Updated last year
Alternatives and similar repositories for CS234-Reinforcement-Learning
Users that are interested in CS234-Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Supplementary Material to accompany the paper, DJ Warne, SA Sisson, C Drovandi (2019) Acceleration of expensive computations in Bayesian…☆13Oct 23, 2020Updated 5 years ago
- Code for utilising VAE as means of doing exact MCMC inference in complex high-dimensional space☆14Jun 20, 2023Updated 2 years ago
- A personal LaTeX class for writing journals☆17Jul 10, 2025Updated 9 months ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch☆22Nov 9, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- Probabilistic deep learning using JAX☆15Feb 8, 2025Updated last year
- ☆25Sep 7, 2025Updated 7 months ago
- Embedded segmental K-means (ES-KMeans) in Python.☆14Apr 22, 2024Updated last year
- ☆20Feb 14, 2025Updated last year
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Perceiver (transformer variant) implemented in JAX and Flax☆13Mar 29, 2021Updated 5 years ago
- A LLM-powered agent for NetHack☆21Nov 4, 2024Updated last year
- ☆15Sep 8, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code of PriorVAE: encoding spatial priors with variational autoencoders☆12Jul 13, 2023Updated 2 years ago
- Automate dating apps with AI☆20Jan 18, 2024Updated 2 years ago
- Implementing the Vision Transformer paper from scratch for course project.☆12Apr 25, 2022Updated 3 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Modular pipeline for design assistance. PYPi Package: https://pypi.org/project/vai-lab☆14Dec 22, 2023Updated 2 years ago
- Official implementation for NeurIPS'24 paper "Generative Semi-supervised Graph Anomaly Detection"☆63Aug 13, 2025Updated 8 months ago
- Official Implementation of "D4Explainer: In-Distribution GNN Explanations via Discrete Denoising Diffusion"☆24Oct 29, 2023Updated 2 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆13Aug 13, 2025Updated 8 months ago
- An implementation of squared neural families in PyTorch☆14Oct 22, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Minimal, fast + educational reimplementation of the TabICLv2 architecture☆65Mar 25, 2026Updated 3 weeks ago
- MoveFormer: a Transformer-based model for step-selection animal movement modelling☆12Aug 5, 2023Updated 2 years ago
- ☆16Apr 8, 2025Updated last year
- ☆18Apr 2, 2023Updated 3 years ago
- RL algorithm implementations from scratch.☆17Nov 22, 2020Updated 5 years ago
- ☆38Jul 16, 2025Updated 8 months ago
- [ICDE 2023] Dynamic hypergraph structure learning for traffic flow forecasting☆21Oct 14, 2022Updated 3 years ago
- Implementation of FedDR algorithm for federated learning.☆11Mar 8, 2022Updated 4 years ago
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆28Dec 14, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 9 months ago
- WIP: Unnoficial implementation of diffusion autoencoders, using pytorch☆11Feb 15, 2023Updated 3 years ago
- ☆22Mar 28, 2025Updated last year
- Code for novel methods for one-shot Federated Learning under high statistical heterogeneity.☆18Oct 2, 2023Updated 2 years ago
- Amortized Probabilistic Conditioning for Optimization, Simulation and Inference (Chang et al., AISTATS 2025)☆21Jan 27, 2026Updated 2 months ago
- ☆28Dec 16, 2022Updated 3 years ago
- Chapter 11: Transfer Learning/Domain Adaptation☆18Jul 23, 2019Updated 6 years ago