☆92Jun 30, 2025Updated 9 months ago
Alternatives and similar repositories for pokemon-gym
Users that are interested in pokemon-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 8 months ago
- Official repository of the NeurIPS 2025 Competition: The PokeAgent Challenge: Competitive and Long-Context Learning at Scale. (Track 2, S…☆81Updated this week
- A paper list for box embeddings☆17Jun 9, 2021Updated 4 years ago
- [EMNLP-2025] R1-Zero on ANY TASK☆30Nov 9, 2025Updated 4 months ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆56Jul 11, 2025Updated 8 months ago
- ☆27Mar 10, 2026Updated 2 weeks ago
- ☆15Feb 23, 2026Updated last month
- ☆79Mar 6, 2026Updated 3 weeks ago
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated 2 months ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 9 months ago
- NLP2025 のチュートリアル「地理情報と言語処理 実践入門」の資料とソースコード☆17Updated this week
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆142Mar 11, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆24Sep 23, 2025Updated 6 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆337Nov 2, 2025Updated 4 months ago
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated last year
- [WWW2022] Geometric Graph Representation Learning via Maximizing Rate Reduction☆26May 27, 2022Updated 3 years ago
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆45Sep 19, 2025Updated 6 months ago
- ☆87Aug 16, 2025Updated 7 months ago
- ☆48Oct 20, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆34Jul 29, 2025Updated 8 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆62Oct 24, 2025Updated 5 months ago
- ☆43Apr 10, 2025Updated 11 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆17Mar 26, 2025Updated last year
- ☆23Apr 4, 2024Updated last year
- An optimization algorithm for the design of pneumatic soft robots.☆14Jul 30, 2025Updated 8 months ago
- R1-like Computer-use Agent☆89Mar 21, 2025Updated last year
- Official implementation of TBA for async LLM post-training.☆29Nov 5, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Apr 11, 2022Updated 3 years ago
- AI powered Virtual Desktop☆17Updated this week
- ☆163Nov 24, 2025Updated 4 months ago
- Small, simple agent task environments for training and evaluation☆19Nov 1, 2024Updated last year
- Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.☆13Mar 20, 2025Updated last year
- ☆14Dec 12, 2024Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago