Code for an intro to RL workshop. You'll be training a simple agent to play pong using policy gradients. Adapted from http://karpathy.github.io/2016/05/31/rl/
☆15Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for pong-with-policy-gradients
Users that are interested in pong-with-policy-gradients are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- use Tensorflow object detection API to detect hand and recognize different getures(5 types gestures)☆11Mar 30, 2018Updated 8 years ago
- I have targeted to solve the benchmark problem in Reinforcement learning literature using Deep Q-networks with images as the only input t…☆12Dec 2, 2019Updated 6 years ago
- Pytorch implementation of 2D and 3D deformable convolution specified in https://arxiv.org/abs/1703.06211.☆19Nov 22, 2025Updated 5 months ago
- 基于人体解析的行人属性识别☆27Jun 26, 2020Updated 5 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hand segmentation model examples☆17Mar 10, 2019Updated 7 years ago
- Real-time fluid simulation using smoothed particle hydrodynamics (SPH) by taking advantage of GPU hardware ray tracing for particle neigh…☆13Apr 20, 2026Updated 2 weeks ago
- ☆12Nov 10, 2020Updated 5 years ago
- Inverse Rendering Toolkit☆14Feb 24, 2025Updated last year
- Data-enriching GAN for retrieving Representative Samples from aTrained Classifier☆14Sep 2, 2020Updated 5 years ago
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆19Oct 22, 2024Updated last year
- A pytorch reimplementation of CheXNet.☆10Jun 26, 2018Updated 7 years ago
- Image based 3d object detection paper list☆34Apr 4, 2023Updated 3 years ago
- Core code of the paper "Unbiased Caustics Rendering Guided by Representative Specular Paths".☆11Sep 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for "The Layer Laboratory: A Calculus for Additive and Subtractive Composition of Anisotropic Surface Reflectance" (SIGGRAPH 2018) b…☆10Apr 7, 2021Updated 5 years ago
- 存放我的“信息内容安全”实验作业代码☆11May 11, 2019Updated 6 years ago
- A small, educational autograd system with deep neural networks support☆13Apr 29, 2023Updated 3 years ago
- Path tracer using DirectCompute.☆17Apr 26, 2026Updated last week
- ☆17Nov 15, 2021Updated 4 years ago
- XXE - VULNSPY PHP AUDIT☆18Oct 15, 2018Updated 7 years ago
- CUDA accelerated medical imaging algorithms☆16May 9, 2022Updated 3 years ago
- Contact model for 3D elastic rod simulations. Framework for flagella bundling.☆13Mar 29, 2024Updated 2 years ago
- Generalizable Implicit Hate Speech Detection using Contrastive Learning (COLING 2022)☆14Oct 9, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Large Scene Rendering Viewer☆11Jan 6, 2022Updated 4 years ago
- A collection of examples following the OptiX 7 Siggraph course that demonstrate how to use Slang with OptiX☆14Aug 26, 2021Updated 4 years ago
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆37Feb 23, 2016Updated 10 years ago
- DoDoUI☆15Nov 11, 2023Updated 2 years ago
- Evaluating Adversarial Attacks on Driving Safety in Vision-Based Autonomous Vehicles☆20Jul 26, 2023Updated 2 years ago
- Library for representation and manipulation of generalised T-Spline surfaces written in C++.☆11Oct 31, 2019Updated 6 years ago
- An OptiX 7 implementation of SPCBPT: Subspace-based Probabilistic Connections for Bidirectional Path Tracing☆15Apr 15, 2024Updated 2 years ago
- Numpy/Scipy implementation of the (fast) Guided Filter☆60Jul 7, 2017Updated 8 years ago
- Face Recognition using Deep Learning and TensorFlow Framework☆10Jul 19, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.☆59Mar 18, 2026Updated last month
- Labs for deep learning course.☆16Jun 21, 2021Updated 4 years ago
- object detection, adversarial robustness, ICIP2021☆17Jan 10, 2021Updated 5 years ago
- 微博情感分析,使用flask制作restful api,毕业设计衍生项目☆17Dec 16, 2017Updated 8 years ago
- Siggraph Asia 2023 Paper "Extended Path Space Manifold for Physically Based Differentiable Rendering"☆17May 11, 2024Updated last year
- 为CSGO人工智能ai铺垫的csgo api 读取和执行必要的信息和决策☆18Apr 16, 2021Updated 5 years ago
- Code for our SIGGRAPH 2023 paper, "Acting as Inverse Inverse Planning"☆20Apr 21, 2023Updated 3 years ago