pearls-lab/meow-tea-taro

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pearls-lab/meow-tea-taro)

pearls-lab / meow-tea-taro

A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning

☆83

Alternatives and similar repositories for meow-tea-taro

Users that are interested in meow-tea-taro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tinaliu0123 / speculative-verdict
View on GitHub
Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation (ICLR 2026)
☆21Apr 27, 2026Updated 2 months ago
microsoft / tale-suite
View on GitHub
Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.
☆30Updated this week
SimWorld-AI / DeliveryBench
View on GitHub
DeliveryBench: Can Agents Earn Profit in Real World?
☆18Feb 11, 2026Updated 5 months ago
SalesforceAIResearch / UserRL
View on GitHub
The raw UserRL repo under construction
☆111Jun 2, 2026Updated last month
SakanaAI / TransEvalnia
View on GitHub
Reasoning-based Evaluation and Ranking of Translations.
☆21Jun 2, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
sotopia-lab / sotopia-pi
View on GitHub
Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)
☆85May 7, 2024Updated 2 years ago
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
BaohaoLiao / SAGE
View on GitHub
Self-Hinting Language Models Enhance Reinforcement Learning
☆26Mar 28, 2026Updated 3 months ago
howard-yen / SLIM
View on GitHub
☆27Jun 22, 2026Updated last month
OoDBag / VisTA
View on GitHub
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
☆27May 31, 2025Updated last year
Time-Search / TimeSearch-R
View on GitHub
[ICLR 2026] Official code for paper: TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinf…
☆27Jan 29, 2026Updated 5 months ago
WentseChen / Verlog
View on GitHub
Verlog: A Multi-turn RL framework for LLM agents
☆73Apr 28, 2026Updated 2 months ago
EvanZhuang / knowledge_flow
View on GitHub
Official Implementation of Knowledge Flow Prompting
☆35Oct 20, 2025Updated 9 months ago
weizhepei / WebAgent-R1
View on GitHub
[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
☆94Nov 4, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
gangiswag / infogent
View on GitHub
☆24Mar 1, 2025Updated last year
THUDM / AgentRL
View on GitHub
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
☆321Jan 17, 2026Updated 6 months ago
Kwai-Klear / mini-swe-agent-plus
View on GitHub
mini-swe-agent-plus: a tiny (~100 LOC) GitHub issue fixer—now with a robust multi-line text edit tool.
☆24Jan 20, 2026Updated 6 months ago
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆708Jul 29, 2025Updated 11 months ago
neulab / agent-data-protocol
View on GitHub
☆187Jul 14, 2026Updated last week
andrewkho / wordle-solver
View on GitHub
A Deep RL Wordle Bot
☆12Dec 6, 2022Updated 3 years ago
danieldritter / OAPL
View on GitHub
☆30Feb 24, 2026Updated 4 months ago
BKHMSI / mixture-of-cognitive-reasoners
View on GitHub
Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
☆46Feb 7, 2026Updated 5 months ago
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,085Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenRewardAI / openreward-cookbook
View on GitHub
Training and evaluating with OpenReward
☆33Apr 28, 2026Updated 2 months ago
Infini-AI-Lab / Sparrow
View on GitHub
☆16Jun 15, 2026Updated last month
ruiyiw / patient-psi
View on GitHub
PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)
☆115Feb 17, 2026Updated 5 months ago
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,753Apr 14, 2026Updated 3 months ago
SagnikMukherjee / sparsity_in_rl
View on GitHub
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
☆15Oct 20, 2025Updated 9 months ago
ernie-research / MEnvAgent
View on GitHub
Official Code of MEnvAgent
☆23Feb 3, 2026Updated 5 months ago
jasonyux / TriPosT
View on GitHub
☆12Jan 25, 2024Updated 2 years ago
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
WujiangXu / EPO
View on GitHub
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
☆40Jul 13, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sotopia-lab / sotopia-rl
View on GitHub
Sotopia-RL: Reward Design for Social Intelligence
☆52Apr 1, 2026Updated 3 months ago
agentica-project / rllm
View on GitHub
☆403Sep 17, 2025Updated 10 months ago
Yikai-Liao / efficient_bpe
View on GitHub
An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation
☆13Sep 9, 2024Updated last year
upup-wei / RAG-ReasonAlignment
View on GitHub
☆20May 20, 2025Updated last year
ahjwang / messenger-emma
View on GitHub
Implements the Messenger environment and EMMA model.
☆25Jun 14, 2023Updated 3 years ago
open-thoughts / OpenThoughts-Agent
View on GitHub
Data recipes and robust infrastructure for training AI agents
☆260Updated this week
TsinghuaC3I / FS-GEN
View on GitHub
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
☆13Nov 19, 2024Updated last year