强化学习课程,主要是如何用强化学习解决问题
☆15Dec 10, 2024Updated last year
Alternatives and similar repositories for RLCourse
Users that are interested in RLCourse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a deadly simple repl for cpp with ghci style commands☆14Jan 7, 2023Updated 3 years ago
- The improved SSD is used to detect small targets in CCSDB dataset☆12Dec 15, 2020Updated 5 years ago
- 深入理解计算机系统 课程实验 / CMU 15213: CSAPP Labs☆12Dec 13, 2019Updated 6 years ago
- java实现跨域SSO单点登录 springboot + SSO + JWT☆13Jun 17, 2022Updated 3 years ago
- ☆12Oct 18, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The project covers common metrics for super-resolution performance evaluation.☆12Dec 27, 2021Updated 4 years ago
- Project for CVPR 21 paper: "Cross-MPI: Cross-scale Stereo for Image Super-Resolution using Multiplane Images"☆18Oct 8, 2021Updated 4 years ago
- SimKO: Simple Pass@K Policy Optimization☆28Oct 24, 2025Updated 5 months ago
- This repo consists all my RL work and learnings☆12Dec 5, 2021Updated 4 years ago
- 整理所有特征工程用到的方法,为了复用☆11Jan 11, 2021Updated 5 years ago
- BASE-SQL: A powerful open source Text-To-SQL baseline approach☆13Feb 18, 2025Updated last year
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Apr 21, 2024Updated last year
- 北京科技大学校园网自动登录程序☆12Jan 6, 2022Updated 4 years ago
- A Pytorch implementation of "An Efficient Unfolding Network with Disentangled Spatial-Spectral Representation for Hyperspectral Image Sup…☆13Jan 26, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- Node.js 我去图书馆自动预约脚本☆26Oct 27, 2021Updated 4 years ago
- Repo allows users to test different DL archictectures when applied to time series forecasting of weather data (TCN, LSTM, BiLSTM, GRU, Bi…☆20Mar 14, 2025Updated last year
- Implementation of the paper "SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition".☆28Aug 11, 2025Updated 7 months ago
- Adaptive Machine Learning-Based Stock Prediction using Financial Time Series Technical Indicators☆10Dec 21, 2019Updated 6 years ago
- ☆30Sep 24, 2024Updated last year
- CPU Memory Compiler and Parallel programing☆26Nov 18, 2024Updated last year
- A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management☆13Jan 11, 2023Updated 3 years ago
- The pytorch code of hyperspectral image super-resolution method CST.☆16Sep 11, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 基于deeplabv3plus网络实现了虹膜图像分割以及水果图像分割☆20Aug 28, 2020Updated 5 years ago
- A simple lucky draw machine.☆24Jun 8, 2021Updated 4 years ago
- Rust ChatGPT CLI is a user-friendly terminal interface for OpenAI's GPT chatbot. Start conversations, manage ongoing chats, and send text…☆11May 9, 2023Updated 2 years ago
- 收集自网络的电视直播源及可以播放m3u、m3u8、flv、ts的Chrome扩展程序。☆57Updated this week
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆22Feb 5, 2026Updated last month
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated 2 weeks ago
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated 11 months ago
- 一款虚拟交换机软件,可以把不在同一地点的主机接入一个虚拟局域网下。☆17Oct 16, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- ☆12Sep 26, 2021Updated 4 years ago
- ☆15Jun 16, 2023Updated 2 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated 11 months ago
- The python notebook is on googles new collabatory tool. Its a churn model being run on 3 different algorithms to compare.☆10Mar 3, 2018Updated 8 years ago
- The official implementation of SRM-Hair.☆35Mar 11, 2025Updated last year