A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
☆81Jan 16, 2026Updated 2 months ago
Alternatives and similar repositories for meow-tea-taro
Users that are interested in meow-tea-taro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆26Apr 1, 2026Updated last week
- ☆23Apr 2, 2026Updated last week
- DUNL - Neuron 2025☆25Jan 18, 2026Updated 2 months ago
- A Deep RL Wordle Bot☆12Dec 6, 2022Updated 3 years ago
- [NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding☆86Dec 14, 2025Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆24Mar 1, 2025Updated last year
- ☆21Sep 7, 2025Updated 7 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 7 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated 11 months ago
- ☆79Sep 15, 2025Updated 6 months ago
- An Open-Ended Agentic Simulator☆60Aug 11, 2024Updated last year
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- ☆32Jan 9, 2026Updated 3 months ago
- A repository for training nanogpt-based Chess playing language models.☆28Apr 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Txing 是一个基于 Vue3 + SpringBoot 的在线编程学习平台,集成了在线做题、编程竞赛、即时通讯、文章创作、视频教程、技术论坛等功能模块。前端采用 Vue3 + TypeScript + Arco Design 构建,后端使用 SpringBoot + …☆27Feb 27, 2025Updated last year
- ☆35May 16, 2025Updated 10 months ago
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- Data and codes for MetroGAN☆16Dec 23, 2024Updated last year
- ☆31Jun 12, 2024Updated last year
- 基于QT的聊天社交软件☆17Nov 19, 2022Updated 3 years ago
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- ☆60Sep 17, 2025Updated 6 months ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- DMALab's reading group slides and papers.☆16Jun 8, 2021Updated 4 years ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆98Oct 27, 2025Updated 5 months ago
- ☆29Nov 9, 2025Updated 5 months ago
- This is a Python implementation of Alphazero (for chess) using a custom GUI☆20Aug 1, 2018Updated 7 years ago
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 8 months ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- ☆116Jan 21, 2025Updated last year
- Data for paper "CC-Riddle: A Question Answering Dataset of Chinese Character Riddles": https://arxiv.org/abs/2206.13778☆20Aug 19, 2023Updated 2 years ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆465Mar 26, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Benchmark and optimize LLM inference across frameworks with ease☆178Sep 12, 2025Updated 6 months ago
- ☆13Oct 5, 2025Updated 6 months ago
- Resa: Transparent Reasoning Models via SAEs☆48Sep 23, 2025Updated 6 months ago
- Fine-tuning embedding models.☆14Nov 25, 2024Updated last year
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Official Repository of Native Parallel Reasoner☆105Feb 5, 2026Updated 2 months ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Oct 19, 2025Updated 5 months ago