Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
☆124Dec 4, 2025Updated 3 months ago
Alternatives and similar repositories for collaborative-gym
Users that are interested in collaborative-gym are comparing it to the libraries listed below
Sorting:
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Oct 15, 2025Updated 4 months ago
- ☆13Feb 4, 2025Updated last year
- Reproducible Language Agent Research☆34Jun 25, 2025Updated 8 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- Azure Command-Line Interface☆12Dec 10, 2023Updated 2 years ago
- code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations☆11Oct 13, 2023Updated 2 years ago
- Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"☆16Aug 2, 2025Updated 7 months ago
- An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…☆21Feb 25, 2026Updated last week
- ☆10Jun 15, 2024Updated last year
- Keras Implementation of DDPG(Deep Deterministic Policy Gradient) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆13Mar 25, 2023Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆78May 2, 2025Updated 10 months ago
- ☆16Apr 19, 2021Updated 4 years ago
- A data construction and evaluation framework to quantify privacy norm awareness of language models (LMs) and emerging privacy risk of LM …☆43Mar 4, 2025Updated last year
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40May 26, 2025Updated 9 months ago
- ☆15Mar 26, 2024Updated last year
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Jun 10, 2021Updated 4 years ago
- Repository containing code for the NAACL 2021 paper (Incorporating External Knowledge to Enhance Tabular Reasoning)☆17Jun 20, 2021Updated 4 years ago
- A toolkit to induce interpretable workflows from raw computer-use activities.☆39Nov 13, 2025Updated 3 months ago
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆14Jun 23, 2024Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 8 months ago
- 🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence☆30May 21, 2025Updated 9 months ago
- ☆27Nov 27, 2025Updated 3 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆16Dec 22, 2024Updated last year
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆644Jul 29, 2025Updated 7 months ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago
- The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral …☆20Jun 13, 2025Updated 8 months ago
- PRODIGy is a collection of dialogues in which each conversation is aligned with speaker profile representations.☆19Jan 8, 2025Updated last year
- ☆53Feb 19, 2025Updated last year
- ☆24Apr 3, 2025Updated 11 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆281Jan 23, 2026Updated last month
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Oct 18, 2024Updated last year
- ☆25May 28, 2025Updated 9 months ago
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 7 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆56Jul 11, 2025Updated 7 months ago