A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
☆83Jan 16, 2026Updated 4 months ago
Alternatives and similar repositories for meow-tea-taro
Users that are interested in meow-tea-taro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆30May 8, 2026Updated last month
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆30May 9, 2026Updated last month
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆40Nov 26, 2025Updated 6 months ago
- ☆24May 26, 2026Updated 2 weeks ago
- The Definitive guide to OpenSearch☆20Mar 2, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆24Mar 1, 2025Updated last year
- [NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding☆94Dec 14, 2025Updated 5 months ago
- ☆21Sep 7, 2025Updated 9 months ago
- ☆28Jul 29, 2025Updated 10 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 9 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated last year
- ☆84Sep 15, 2025Updated 8 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- PARL (Parallel-Agent Reinforcement Learning) is a training paradigm that teaches models to decompose complex tasks into parallel subtasks…☆47Mar 24, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Jan 27, 2024Updated 2 years ago
- ☆34Jan 9, 2026Updated 5 months ago
- ☆35May 16, 2025Updated last year
- Data and codes for MetroGAN☆16Dec 23, 2024Updated last year
- ☆32Jun 12, 2024Updated last year
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- This is the code of experiments in paper Cross-Domain Adversarial Auto-Encoder(https://arxiv.org/abs/1804.06078)☆12Apr 18, 2018Updated 8 years ago
- DMALab's reading group slides and papers.☆16Jun 8, 2021Updated 5 years ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 人脸表情识别系统☆14Jun 1, 2020Updated 6 years ago
- ☆30Nov 9, 2025Updated 7 months ago
- 基于 Spring Boot + Spring Cloud Alibaba 微服务 + Docker + RabbitMQ + Es + Vue 3 的 编程算法题目在线评测系统☆23Sep 20, 2024Updated last year
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 10 months ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆102Oct 27, 2025Updated 7 months ago
- Data for paper "CC-Riddle: A Question Answering Dataset of Chinese Character Riddles": https://arxiv.org/abs/2206.13778☆20Aug 19, 2023Updated 2 years ago
- ☆116Jan 21, 2025Updated last year
- Simplifying the autonomous vehicle development process.☆19Jul 22, 2025Updated 10 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆510Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Dec 4, 2024Updated last year
- Scaling Motion Generation Model with Million-Level Human Motions (ICML 2025)☆69May 14, 2025Updated last year
- ☆15Oct 5, 2025Updated 8 months ago
- Resa: Transparent Reasoning Models via SAEs☆49Sep 23, 2025Updated 8 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆189Sep 12, 2025Updated 8 months ago
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆112Feb 5, 2026Updated 4 months ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago