A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
☆83Jan 16, 2026Updated 5 months ago
Alternatives and similar repositories for meow-tea-taro
Users that are interested in meow-tea-taro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆31May 8, 2026Updated last month
- Region Encoder Network☆21Oct 2, 2025Updated 8 months ago
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆28Aug 7, 2025Updated 10 months ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆30May 9, 2026Updated last month
- The Definitive guide to OpenSearch☆21Mar 2, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Deep RL Wordle Bot☆12Dec 6, 2022Updated 3 years ago
- [NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding☆96Dec 14, 2025Updated 6 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 10 months ago
- ☆12Jan 25, 2024Updated 2 years ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated last year
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- ☆88Sep 15, 2025Updated 9 months ago
- ☆34Jan 9, 2026Updated 5 months ago
- ☆35May 16, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆32Jun 12, 2024Updated 2 years ago
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- DMALab's reading group slides and papers.☆16Jun 8, 2021Updated 5 years ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated 2 years ago
- ☆30Nov 9, 2025Updated 7 months ago
- Natural Language Reinforcement Learning☆101Jul 30, 2025Updated 11 months ago
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆103Oct 27, 2025Updated 8 months ago
- ☆116Jan 21, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆523Jun 20, 2026Updated last week
- ☆13Dec 4, 2024Updated last year
- ☆15Oct 5, 2025Updated 8 months ago
- Resa: Transparent Reasoning Models via SAEs☆49Sep 23, 2025Updated 9 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆194Sep 12, 2025Updated 9 months ago
- source files for GloBI website☆10Updated this week
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Oct 19, 2025Updated 8 months ago
- Implementation and modification of CartoonGAN☆20Mar 15, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆97Oct 30, 2025Updated 8 months ago
- segment a set of rasters using rasterio and skimage☆12Feb 13, 2017Updated 9 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- Neutral landscape generator that allows users to set targets on landscape indices.☆13May 22, 2024Updated 2 years ago
- All information and news with respect to Falcon-H1 series☆119Oct 9, 2025Updated 8 months ago
- ☆10Dec 17, 2020Updated 5 years ago
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆141Nov 10, 2025Updated 7 months ago