Snake game RL environment for Ubiquant competition 2022.
☆22May 23, 2022Updated 4 years ago
Alternatives and similar repositories for QSnakeGame
Users that are interested in QSnakeGame are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A unified robotic manipulation learning framework☆22Sep 4, 2025Updated 8 months ago
- 个人科研笔记网站☆21Mar 29, 2025Updated last year
- code for RIM☆22Nov 18, 2022Updated 3 years ago
- Code to reproduce experiments from:☆10Dec 11, 2020Updated 5 years ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Statistical mechanics models such as random cluster models, random growth models and related processes.☆12Jan 24, 2025Updated last year
- Paper list for constrained policy optimization in reinforcement learning.☆73Nov 7, 2023Updated 2 years ago
- Hardware-efficient learning of quantum many-body states. Code for simulating a U(1) lattice gauge theory and classifying topological orde…☆12Dec 8, 2022Updated 3 years ago
- Material for the course Theories of Quantum Matter at the University of Cambridge☆12Jan 20, 2023Updated 3 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 7 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- A PyTorch Lightning template to try out a wide range of ideas on the Ubiquant Market Prediction competition without modifying any code!☆12Mar 24, 2022Updated 4 years ago
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 5 years ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆33Mar 27, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Oct 16, 2024Updated last year
- ☆11Oct 19, 2018Updated 7 years ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- A simple 2D ball collision engine.☆12Jun 15, 2023Updated 2 years ago
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆12Apr 12, 2024Updated 2 years ago
- A LaTeX template for replying to paper reviews☆16May 15, 2019Updated 7 years ago
- Code for the Cardiac MRI Reconstruction Challenge 2025 (CMRxRecon2025)☆21Apr 17, 2026Updated last month
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- # Algorithms compared: # > Proximal Gradient Descent # > Accelerated Proximal Gradient Descent # > Coordinate Descent # > Alternating Di…☆11Feb 4, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is an implementation of the paper "Coordinated Multi Agent Imitation Learning", or the Sloan version "Data-Driven Ghosting using Dee…☆41Jun 28, 2018Updated 7 years ago
- ☆12May 10, 2018Updated 8 years ago
- This repo is for reproducing our results in “Lipschitz Generative Adversarial Nets”.☆11Sep 26, 2020Updated 5 years ago
- Calibrate both hand-in-eye and hand-to-eye simultaneously with colmap☆15Oct 20, 2024Updated last year
- CineVN: Variational network reconstruction for rapid functional cardiac cine MRI☆14Oct 29, 2024Updated last year
- ☆15Mar 19, 2025Updated last year
- A Matab framework for GHD calculations.☆18Dec 10, 2024Updated last year
- Code for predicting ground state properties using new ML model.☆15Dec 16, 2023Updated 2 years ago
- Official repository for "CUPID: Curating Data your Robot Loves with Influence Functions," accepted to CoRL 2025.☆35Aug 9, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A project on Fourier grid Hamiltonian method and Quantum Monte Carlo method☆17Dec 12, 2016Updated 9 years ago
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- Random unitary time evolution plus projective measurement in the one-dimensional quantum circuit model☆13May 12, 2020Updated 6 years ago
- A simple shellscript for splitting the PDF of a paper into the main body and an appendix.☆18Jun 1, 2020Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- ☆16Mar 27, 2025Updated last year
- ☆19Oct 30, 2023Updated 2 years ago