A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
☆82Jan 16, 2026Updated 4 months ago
Alternatives and similar repositories for meow-tea-taro
Users that are interested in meow-tea-taro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆30May 8, 2026Updated last week
- DUNL - Neuron 2025☆25Jan 18, 2026Updated 4 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 8 months ago
- ☆81Sep 15, 2025Updated 8 months ago
- Tensorflow 2.0 Implement of AnimeGAN☆12Apr 26, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- ☆20Jan 27, 2024Updated 2 years ago
- ☆35May 16, 2025Updated last year
- Data and codes for MetroGAN☆16Dec 23, 2024Updated last year
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- DMALab's reading group slides and papers.☆16Jun 8, 2021Updated 4 years ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated last year
- ☆29Nov 9, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 9 months ago
- ☆116Jan 21, 2025Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆40Dec 27, 2022Updated 3 years ago
- Simplifying the autonomous vehicle development process.☆19Jul 22, 2025Updated 9 months ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆492May 12, 2026Updated last week
- ☆13Oct 5, 2025Updated 7 months ago
- Fine-tuning embedding models.☆14Nov 25, 2024Updated last year
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆110Feb 5, 2026Updated 3 months ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19Apr 2, 2024Updated 2 years ago
- All information and news with respect to Falcon-H1 series☆117Oct 9, 2025Updated 7 months ago
- ☆95Oct 30, 2025Updated 6 months ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆137Nov 10, 2025Updated 6 months ago
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- ☆10Dec 17, 2020Updated 5 years ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆18Dec 22, 2024Updated last year
- Codebase for LangNav paper☆19Jun 13, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- ICS Seminar 21, 2018 @ PKU☆17Jan 2, 2019Updated 7 years ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆75Apr 22, 2026Updated 3 weeks ago
- Multimodal RewardBench☆68Feb 21, 2025Updated last year
- ROS 2 interface and imitation learning pipeline for the Tidybot++ mobile manipulator☆29Mar 30, 2026Updated last month
- A collection of AWESOME language modeling techniques on tabular data applications.☆34Oct 14, 2024Updated last year
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated last month