reinforcement learning
☆37Mar 20, 2018Updated 8 years ago
Alternatives and similar repositories for rl
Users that are interested in rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is made for medical image classification using Deep Reinforcement Learning☆17Jan 29, 2021Updated 5 years ago
- Deep Q-Learning Auto Market Maker☆12Jun 12, 2021Updated 5 years ago
- This project contains several Deep Reinforcement Learning method and some experiments basd on OpenAi gym.☆19Jan 28, 2018Updated 8 years ago
- 使用mobilenet改造HED实现在手机端进行文档的边缘检测☆24Jan 3, 2019Updated 7 years ago
- source code for China Commun. paper "Multi-Vehicle Cooperative Positioning Based on Edge-Computed Multidimensional Scaling"☆12Dec 31, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Posted at AAAI 2023☆11Sep 4, 2025Updated 9 months ago
- Deep Q-Network (DQN) and DDPG to address the problem of stall around the wing sail of an autonomous sailing robot☆11Sep 18, 2018Updated 7 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- UAV offloading based on QMIX☆17Oct 12, 2023Updated 2 years ago
- 💳 Creates a new gym environment for credit-card anomaly detection using Deep Q-Networks (DQN) and leverages Open AI's Gym toolkit to all…☆19Nov 1, 2020Updated 5 years ago
- ☆19Oct 21, 2021Updated 4 years ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architectur…☆17Jan 2, 2023Updated 3 years ago
- Demonstration of a factory pattern where the types automatically register themselves☆13Mar 13, 2019Updated 7 years ago
- DyNet implementation of stack LSTM experiments by Grefenstette et al.☆21Oct 6, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Mar 6, 2023Updated 3 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- 日志可视化进阶☆13May 8, 2017Updated 9 years ago
- This is the pytorch implementation of FCL-Net, accepted by NN'2022.☆14May 25, 2022Updated 4 years ago
- rule matcher (context free grammar)☆10Dec 27, 2019Updated 6 years ago
- 小熠岩土勘察☆11Oct 29, 2017Updated 8 years ago
- Add attention layer to LSTM/word2vec model for sentiment analysis using tensorflow☆26Sep 30, 2017Updated 8 years ago
- Distantly Supervised NER with Partial Annotation Learning and Reinforcement Learning☆132Mar 21, 2019Updated 7 years ago
- https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation☆11Sep 11, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code for paper Classification with Costly Features using Deep Reinforcement Learning.☆57Oct 5, 2021Updated 4 years ago
- Code for paper "Channel Pruning Guided by Spatial and Channel Attention for DNNs in Intelligent Edge Computing"☆20Aug 22, 2025Updated 9 months ago
- ☆14Dec 2, 2018Updated 7 years ago
- 数字中国华为OCR_Top6 使用pixel link+Densenet作为主体算法☆12Apr 22, 2019Updated 7 years ago
- Due to the postponement of the release of the woodscape dataset, I plan to create a soiling dataset myself for research. Deeplabv3+ train…☆15Sep 24, 2024Updated last year
- Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks☆39Feb 5, 2020Updated 6 years ago
- This repository contains a visual studio project for training a classifier on the mnist dataset using the libtorch c++ wrapper.☆12Oct 13, 2020Updated 5 years ago
- Crawling scores from education system of my school.☆10Apr 8, 2021Updated 5 years ago
- RL Recommendation System☆13Aug 30, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- convert model from mxnet to caffe without lossing precision☆19Jul 3, 2018Updated 7 years ago
- Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch☆16Nov 2, 2017Updated 8 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- ☆23Apr 24, 2013Updated 13 years ago
- 2019年腾讯广告算法大赛rank68☆14Jun 14, 2019Updated 7 years ago
- PyTorch implementation of "Metric Learning with Adaptive Density Discrimination"☆13Mar 25, 2019Updated 7 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated 2 years ago