Lightly-reviewed collection of community environments
☆215Updated this week
Alternatives and similar repositories for community-environments
Users that are interested in community-environments are comparing it to the libraries listed below
Sorting:
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆161Updated this week
- Async RL Training at Scale☆1,096Updated this week
- Our library for RL environments + evals☆3,869Updated this week
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- Swift Implementation of the Model Context Protocol (MCP) Spec☆10Mar 28, 2025Updated 11 months ago
- ☆17Apr 11, 2025Updated 10 months ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 3 years ago
- Modded vLLM to run pipeline parallelism over public networks☆40May 20, 2025Updated 9 months ago
- Generate VM's with kernel tracing, code sandboxing and security profiles for long running agents.☆24Jan 5, 2026Updated last month
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆32May 30, 2025Updated 9 months ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆14Apr 25, 2024Updated last year
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆54Jul 28, 2025Updated 7 months ago
- NeuroBLAST v3 architecture code☆36Jan 6, 2026Updated last month
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Handwritten Number Recognition using CNN and Character Segmentation☆18Apr 20, 2018Updated 7 years ago
- Goldfish: Monolingual language models for 350 languages.☆23Aug 25, 2024Updated last year
- ☆47Jun 20, 2024Updated last year
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆20Dec 1, 2025Updated 3 months ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Over 60 figures and diagrams of LLMs, quantization, low-rank adapters (LoRA), and chat templates FREE TO USE in your blog posts, slides, …☆22Feb 18, 2025Updated last year
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆131Feb 21, 2026Updated last week
- ☆16May 5, 2022Updated 3 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆22Apr 26, 2023Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- rl from zero pretrain, can it be done? yes.☆287Sep 28, 2025Updated 5 months ago
- Reproducible, flexible LLM evaluations☆340Jan 28, 2026Updated last month
- Convert GitHub PRs into Harbor tasks☆44Updated this week
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆27Jul 23, 2025Updated 7 months ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 2 months ago
- Collection of resources for RL and Reasoning☆27Feb 3, 2025Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆67Feb 12, 2025Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Dec 10, 2024Updated last year
- Microprocessor 2 Lab Template☆11Apr 29, 2024Updated last year
- Leader Arm Design for ARX R5/X5 and Trossen WidowX AI☆47Sep 5, 2025Updated 5 months ago
- This tool allows local LLM usage that can automate tasks without human interventention. The agent can call itself recursively and work on…☆20May 5, 2025Updated 9 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆79Jul 31, 2025Updated 7 months ago
- Official Code Release for "Training a Generally Curious Agent"☆45May 18, 2025Updated 9 months ago