Notes from someone who achieved the highest score on the CS35L final exam by accident.
☆24Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for CS35L_notes
Users that are interested in CS35L_notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- ☆91Sep 21, 2022Updated 3 years ago
- Some basic examples of playing with RL☆1,274Feb 18, 2026Updated 3 months ago
- ☆216Feb 12, 2024Updated 2 years ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆244Apr 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🔥 A minimal training framework for scaling FLA models☆392Apr 22, 2026Updated last month
- The related works and background techniques about Openai o1☆222Jan 7, 2025Updated last year
- HOVER☆743Jul 30, 2025Updated 10 months ago
- ☆773Sep 18, 2025Updated 8 months ago
- jemdoc with MathJax support and more☆261Jun 30, 2024Updated last year
- A light-weight deep reinforcement learning framework for portfolio management. This project explores the possibility of applying deep rei…☆692Nov 6, 2024Updated last year
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆404Mar 19, 2025Updated last year
- Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data☆372Mar 21, 2023Updated 3 years ago
- GPU-optimized version of the MuJoCo physics simulator, designed for NVIDIA hardware.☆1,297Jun 9, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆430Apr 29, 2024Updated 2 years ago
- Related papers for reinforcement learning, including classic papers and latest papers in top conferences☆572Feb 6, 2026Updated 4 months ago
- [RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion☆4,258Dec 24, 2024Updated last year
- ☆762Dec 5, 2024Updated last year
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆766Mar 22, 2024Updated 2 years ago
- Real-time behaviour synthesis with MuJoCo, using Predictive Control☆1,662May 20, 2026Updated 3 weeks ago
- A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)☆879May 12, 2026Updated last month
- ML Collections is a library of Python Collections designed for ML use cases.☆1,031Mar 14, 2026Updated 3 months ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆1,002Jan 30, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A beautiful, simple, clean, and responsive Jekyll theme for academics☆15,711Jun 2, 2026Updated last week
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,086Jul 14, 2023Updated 2 years ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,242May 8, 2024Updated 2 years ago
- PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437☆1,239Feb 25, 2025Updated last year
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,321Feb 26, 2026Updated 3 months ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆1,382Oct 17, 2025Updated 7 months ago
- Hardware backdoors in some x86 CPUs☆2,397Oct 12, 2018Updated 7 years ago
- Isaac Gym Environments for Legged Robots☆3,010May 29, 2025Updated last year
- Isaac Gym Reinforcement Learning Environments☆2,943Oct 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reference implementation for DPO (Direct Preference Optimization)☆2,886Aug 11, 2024Updated last year
- Official Implementation of Rectified Flow (ICLR2023 Spotlight)☆1,612Jul 20, 2024Updated last year
- Muon is an optimizer for hidden layers in neural networks☆2,656May 24, 2026Updated 3 weeks ago
- A curated list of Diffusion Model in RL resources (continually updated)☆1,611May 30, 2026Updated 2 weeks ago
- Source for the little book about OS development☆2,637Apr 22, 2023Updated 3 years ago
- 80x23 terminal tetris!☆3,283Jul 9, 2024Updated last year
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,912Jan 21, 2024Updated 2 years ago