University Project for me to understand different RL Algos and how to implement a proper environment
☆55Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for ReinforcementLearning
Users that are interested in ReinforcementLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evolution of Discrete data with Reinforcement Learning☆13Dec 8, 2019Updated 6 years ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- Multilingual Neural Machine Translation using Transformers with Conditional Normalization.☆18Mar 24, 2023Updated 3 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 3 months ago
- Pretrained TorchVision models on CIFAR10 dataset (with weights)☆23Aug 17, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper "On the Minimal Supervision for Training Any Binary Classifier from Only Unlabeled Data".☆28May 7, 2019Updated 6 years ago
- Simple distribute job scheduler for multiple servers with only SSH. No additions.☆10Dec 8, 2022Updated 3 years ago
- ☆46Mar 24, 2023Updated 3 years ago
- Code for AAAI 2019 Network Interpretability workshop paper☆16Jul 5, 2021Updated 4 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆22Feb 1, 2020Updated 6 years ago
- An experimental API for Extreme Learning machines Neural Networks made with TensorFlow.☆10Oct 23, 2018Updated 7 years ago
- Instance-Dependent Partial Label Learning(NIPS'21);Variational Label Enhancement for Instance-Dependent Partial Label Learning(TPAMI)☆27Jun 5, 2025Updated 10 months ago
- little balancing robot☆11Aug 1, 2019Updated 6 years ago
- [Re] Can gradient clipping mitigate label noise? (ML Reproducibility Challenge 2020)☆14Sep 3, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Hypervisor from scratch in linux☆13May 8, 2022Updated 3 years ago
- Implementation of the DenStream algorithm in Python.☆12Nov 24, 2025Updated 4 months ago
- Pytorch implentation of stock prediction via LSTMs☆26Mar 15, 2018Updated 8 years ago
- ☆13Dec 3, 2015Updated 10 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- FedUL: Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients☆32Jul 11, 2023Updated 2 years ago
- Starter notebooks and winning solutions for Challenges☆16Mar 3, 2026Updated last month
- Datasets for fairness-aware machine learning☆12Mar 4, 2025Updated last year
- [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!☆104Dec 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Analyzing different ML model comparison metrics☆17Jan 20, 2024Updated 2 years ago
- ☆14Mar 15, 2021Updated 5 years ago
- ☆16Oct 26, 2018Updated 7 years ago
- ☆12Aug 25, 2020Updated 5 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Jun 19, 2021Updated 4 years ago
- ☆16Jul 26, 2017Updated 8 years ago
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- A barely barebone NumPy implementation of Hierarchical Temporal Memory.☆11Mar 26, 2023Updated 3 years ago
- a super simple node.js JSON based roles management system☆57Jul 16, 2010Updated 15 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- How to use LSTM trained in Keras in your Java project.☆30Jun 30, 2016Updated 9 years ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- ☆20Oct 19, 2022Updated 3 years ago
- Prototype implementation of napari-integrated CNN training viewer☆14Dec 27, 2024Updated last year
- ☆14Oct 25, 2016Updated 9 years ago
- dentifying gender and regional accent from speech☆37Aug 21, 2018Updated 7 years ago
- Code for the "Bell-Boy" Project☆10Aug 31, 2019Updated 6 years ago