Ludic – an LLM-RL library for the era of experience
☆62Jan 9, 2026Updated 2 months ago
Alternatives and similar repositories for ludic
Users that are interested in ludic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 6, 2024Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆33Oct 8, 2025Updated 5 months ago
- OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents [NeurIPS 2025 Spotlight]☆57Sep 18, 2025Updated 6 months ago
- AI eXplainable Inference & Search. Open Sourcing on-premise, ultra-fast latency intelligence to all.☆37Feb 28, 2025Updated last year
- ☆120Feb 25, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Harness for running and evaluating AI agents against RL environments☆135Mar 6, 2026Updated 3 weeks ago
- NeuroBLAST v3 architecture code☆36Jan 6, 2026Updated 2 months ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- ☆77Feb 18, 2026Updated last month
- MoE training for Me and You and maybe other people☆381Mar 15, 2026Updated last week
- Research agents harness☆68Mar 8, 2026Updated 2 weeks ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- ☆14Apr 16, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆45Jun 23, 2025Updated 9 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- prediction market indexer with semantic search☆35Jan 27, 2026Updated last month
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated last year
- ☆38Oct 23, 2025Updated 5 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆331Feb 18, 2026Updated last month
- ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …☆45Aug 6, 2025Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Fantastic Dungeons - 7DRL 2016☆10Mar 12, 2016Updated 10 years ago
- PyTorch-native post-training at scale☆656Updated this week
- ☆238Jan 5, 2026Updated 2 months ago
- A glowfic to epub converter.☆14Jan 25, 2026Updated 2 months ago
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆77Mar 9, 2026Updated 2 weeks ago
- Lightweight REST API for DuckDB with HTTP/2 streaming support.☆50Mar 13, 2026Updated 2 weeks ago
- Source code for ``How far can we go without convolution: Improving fully-connected networks'' published at ICLR 2016 workshop☆13Nov 10, 2015Updated 10 years ago
- ☆35Sep 22, 2025Updated 6 months ago
- ☆10Jan 10, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jul 25, 2023Updated 2 years ago
- ☆94Jan 21, 2026Updated 2 months ago
- An unofficial MCP interface to interact with the PapersWithCode API☆18Jun 7, 2025Updated 9 months ago
- model UI experiments☆13Aug 20, 2024Updated last year
- ☆42Updated this week
- Measuring General Intelligence With Generated Games (Preprint)☆25Jul 30, 2025Updated 7 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year