weipeilun / texasholdempocker-rlView external linksLinks
A texas holdem poker agent training project using reinforcement learning. 使用强化学习训练德州扑克的agent。
☆23May 6, 2024Updated last year
Alternatives and similar repositories for texasholdempocker-rl
Users that are interested in texasholdempocker-rl are comparing it to the libraries listed below
Sorting:
- 《凯易生活》微信小程序,WX+ThinkPHP(社区商城+垃圾分类回收)☆12Dec 6, 2022Updated 3 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 7 months ago
- 回收废品小程序。☆12Sep 27, 2018Updated 7 years ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- ☆11May 21, 2022Updated 3 years ago
- 在线编辑pdf文档☆11Jun 21, 2022Updated 3 years ago
- Pydantic AI RLM - Handle Extremely Large Contexts with Any LLM Provider☆34Feb 4, 2026Updated last week
- ☆12May 18, 2024Updated last year
- Snake game with bevy or wasm☆12Feb 4, 2023Updated 3 years ago
- 代毕业设计,微信小程序实现垃圾回收预约、回收订单查询修改☆10May 28, 2019Updated 6 years ago
- the code for Twitter @xiaolintemple - A Bot scrap jokes from internet and forward in twitter☆12Feb 5, 2017Updated 9 years ago
- exception handler library for webman plugin☆16Jul 30, 2025Updated 6 months ago
- Agents for intelligence and coordination☆20Jan 4, 2026Updated last month
- Rune Factory 3 Special Internal Trainer/RF3 Internal☆18Jan 21, 2026Updated 3 weeks ago
- Partial Least Squares Path Modeling, Structural Equation Modeling☆18Dec 20, 2021Updated 4 years ago
- A docker compose file tool for webman.☆21Jun 1, 2023Updated 2 years ago
- ☆19Jan 7, 2024Updated 2 years ago
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆24Oct 8, 2024Updated last year
- ☆19May 4, 2024Updated last year
- Vue-inspired reactive building blocks for Flutter☆58Feb 4, 2026Updated last week
- 📝 Summary of recommendation, advertising and search models.【推广搜技术汇总⭐】☆24Feb 2, 2023Updated 3 years ago
- Android PDF Viewer with annotation and multiple user annotate online. 基于PSPDFkit SDK的安卓PDF阅读编辑器器, 支持多人实时在线阅读, 编辑. 服务器端源代码见 https://github…☆28Dec 18, 2016Updated 9 years ago
- AI Prompt Optimization Tool built with React and Cloudflare☆44Aug 11, 2025Updated 6 months ago
- 北京化工大学计算机科学与技术专业 python数据处理大作业,打造一个自定义的购物搜索引擎,搜索关键词,程序自动爬取京东,亚马逊,苏宁等购物网站的商品信息并在结果页面展示,使用异步爬取技术,边爬边展示,降低等待时间,另外有添加到购物车随机选择购买等趣味功能,项目使用Djan…☆27Dec 9, 2018Updated 7 years ago
- [ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"☆34Nov 23, 2025Updated 2 months ago
- ☆31Oct 1, 2018Updated 7 years ago
- A modern, feature-rich Next.js starter template to kickstart your web development projects with best practices and powerful tools pre-con…☆25Dec 27, 2025Updated last month
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning☆47Dec 13, 2019Updated 6 years ago
- An IntelliJ plugin for editing Zephir code☆38Apr 5, 2024Updated last year
- 一个企业商品展示的网站项目,包括首页,新闻展示,商品展示,在线聊天,留言簿,后台管理等功能的简单网站☆37Dec 19, 2015Updated 10 years ago
- codes for SORL framework for auto-bidding☆43Aug 19, 2025Updated 5 months ago
- 基于bpmn.js、jeecg-boot、antd、vue 的工作流程图☆49May 13, 2022Updated 3 years ago
- 扩散模型200行代码实现。Denoising Diffusion Probabilistic Models (DDPM)☆46Jun 6, 2023Updated 2 years ago
- yet another jy copy game☆41Oct 19, 2022Updated 3 years ago
- ☆38Aug 5, 2019Updated 6 years ago
- Multiplayer online game powered by Godot 4 and Rust 🤖🦀☆45Feb 9, 2026Updated last week
- Convert the SearXNG service output from HTML to JSON for publicly available services on the internet.☆47Apr 6, 2025Updated 10 months ago
- 🔒 An authorization library that supports access control models like ACL, RBAC, ABAC for webman plugin☆52Dec 30, 2025Updated last month