☆20Jan 7, 2024Updated 2 years ago
Alternatives and similar repositories for TH_LLM
Users that are interested in TH_LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A game engine for texas holdem no limit with an AI computer player☆10Oct 26, 2020Updated 5 years ago
- ☆88May 29, 2024Updated last year
- Android-Anti-AntiTrace☆11Jun 11, 2019Updated 6 years ago
- ☆10Apr 23, 2021Updated 4 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- flutter饿了么客户端(正在建设中...)☆18Jul 11, 2018Updated 7 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Extensions to the Elo algorithm implemented in JAX☆14Jan 1, 2023Updated 3 years ago
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- ☆71Jan 3, 2023Updated 3 years ago
- Poker multitable tournament ICM estimation☆12Sep 21, 2017Updated 8 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated 2 years ago
- ☆18Nov 16, 2020Updated 5 years ago
- clear single-file JAX implementations of common RL algorithms☆16Sep 5, 2021Updated 4 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆19Sep 17, 2019Updated 6 years ago
- Parser for hands from AI poker bot Pluribus https://science.sciencemag.org/content/early/2019/07/10/science.aay2400☆19Jul 13, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A collection of self-attention modules and pre-trained backbones☆13Nov 28, 2020Updated 5 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- ☆16Jul 13, 2022Updated 3 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Ant design system blueprint for JHipster client☆20Aug 4, 2018Updated 7 years ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated 10 months ago
- ☆16Feb 23, 2024Updated 2 years ago
- A simple key-value database, fast and lightweight. Support Linux/Mac/IOS/Android☆12Dec 18, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Jan 15, 2020Updated 6 years ago
- ☆11Jan 22, 2022Updated 4 years ago
- ☆12Apr 19, 2020Updated 5 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Pollard-Rho-kangaroo solved cuda☆14Oct 9, 2019Updated 6 years ago
- Use YubiKeys (or other security tokens) like a physical key☆18Sep 20, 2019Updated 6 years ago
- Using Natural Language for Reward Shaping in Reinforcement Learning☆24Dec 11, 2023Updated 2 years ago