To reproduce the experiments in Sutton's book
☆14Mar 28, 2025Updated last year
Alternatives and similar repositories for ReinforcementLearning-R.S.
Users that are interested in ReinforcementLearning-R.S. are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- make LLM as a private assistant☆16Apr 11, 2025Updated last year
- Build Jekyll site with GitBook style!☆14May 26, 2025Updated 11 months ago
- 使用Telegram收发微信消息的docker镜像☆11Aug 29, 2022Updated 3 years ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆48Aug 13, 2025Updated 8 months ago
- 空间推理验证码生成器☆20Jun 26, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Scrapy Universal Spider☆57Aug 26, 2017Updated 8 years ago
- 使用Vue3.0 Antd-Design-Vue构建油猴TamperMonkey插件,提高开发效率。Building a TamperMonkey plugin using Vue3.0 and Antd-Design-Vue to improve development …☆86May 30, 2023Updated 2 years ago
- Sougou Weixin Spider Using Proxy☆87May 30, 2021Updated 4 years ago
- ☆78May 27, 2024Updated last year
- ☆94Nov 9, 2024Updated last year
- The code of RouterDC☆71Apr 14, 2025Updated last year
- PingFang SC Fonts For Windows☆43Jan 21, 2016Updated 10 years ago
- fterm是一款基于Flutter开发的跨平台终端工具☆52Jul 27, 2023Updated 2 years ago
- One command to run ChatTTS☆61Jun 6, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆196May 31, 2024Updated last year
- 《手机就是开发板》 所提到的文档,代码和程序☆172Mar 12, 2018Updated 8 years ago
- LocalAGI:Locally run AGI powered by LLaMA, ChatGLM and more. | 基于 ChatGLM, LLaMA 大模型的本地运行的 AGI☆82Jun 25, 2023Updated 2 years ago
- 企业微信基于自建应用的 ChatGPT 聊天机器人☆85Apr 28, 2023Updated 3 years ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆265Apr 14, 2025Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆269Mar 24, 2026Updated last month
- 实现国产算力大模型零门槛部署,一键跑通 Qwen、GLM-4.7、Minimax-2.1、DeepSeek-OCR 等模型☆319Apr 28, 2026Updated last week
- 淘宝购物车站点☆206Mar 22, 2018Updated 8 years ago
- 一款易用的自签证书管理系统☆140Apr 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆285Mar 18, 2025Updated last year
- 基于 GPT 的聊天机器人,用于文档搜索和协助☆135Feb 20, 2023Updated 3 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆415Apr 23, 2026Updated last week
- A compact LLM pretrained in 9 days by using high quality data☆341Apr 9, 2025Updated last year
- Screen broadcast and remote control tool based on FFmpeg. 基于 FFmpeg 的屏幕广播与远程操控工具.☆199Sep 15, 2024Updated last year
- 中文自然语言推理与语义相似度数据集☆366Jan 5, 2022Updated 4 years ago
- QQ和Telegram的双向消息转发☆199Aug 1, 2025Updated 9 months ago
- PyTorch implementation of Trust Region Policy Optimization☆451Sep 13, 2018Updated 7 years ago
- A beautiful, cross-platform downloader for YouTube, TikTok, Instagram, and 1800+ sites (yt-dlp GUI) with AI video summaries and post-proc…☆795Apr 26, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 对话机器人(聊天机器人)设计思考☆805Sep 8, 2020Updated 5 years ago
- 说明:偶然看到大约十年前收藏的一篇文章,今天读来也不过时,分享给大家(手机码字,聚聚们将就看看) 房贷空手道 昨晚,和一个开典当行的同学喝酒。 我问同学,“你买了多少套房子,多少间铺面?” “房子22套,铺面12间。”同学炫耀似的答道。 “那你花了多少钱呢?”同…☆631Aug 4, 2022Updated 3 years ago
- 中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)☆732Oct 3, 2023Updated 2 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆641Feb 10, 2024Updated 2 years ago
- Walrus is an open-source application management platform based on IaC tools including OpenTofu, Terraform and others. It helps platform e…☆438Jun 11, 2024Updated last year
- Build Jekyll site with GitBook style!☆630Aug 11, 2024Updated last year
- IROS2020 paperlist by paopaorobot☆325Dec 2, 2020Updated 5 years ago