arklexai / Agent-First-OrganizationLinks
The official Python library for Arklex framework
β689Updated last week
Alternatives and similar repositories for Agent-First-Organization
Users that are interested in Agent-First-Organization are comparing it to the libraries listed below
Sorting:
- π©ββοΈ Agent-as-a-Judge: The Magic for Open-Endednessβ703Updated 7 months ago
- An agent benchmark with tasks in a simulated software company.β619Updated last month
- β641Updated 2 months ago
- Together Open Deep Researchβ356Updated 8 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycleβ302Updated 3 weeks ago
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β388Updated last week
- AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.β1,108Updated 2 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systemsβ598Updated 4 months ago
- Agentic Web: Weaving the Next Web with AI Agents.β402Updated 3 months ago
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exeβ¦β266Updated 7 months ago
- AWM: Agent Workflow Memoryβ376Updated 2 weeks ago
- [ICLR 2025] Automated Design of Agentic Systemsβ1,485Updated 11 months ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β476Updated 2 months ago
- Readymade evaluators for agent trajectoriesβ438Updated 4 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!β461Updated last year
- β866Updated 4 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and β¦β346Updated last year
- β175Updated 10 months ago
- End-to-end Generative Optimization for AI Agentsβ704Updated last month
- AI benchmark runtime framework that allows you to integrate and evaluate AI tasks using Docker-based benchmarks.β169Updated 3 weeks ago
- Integrating Tool Use into LLM Reasoningβ704Updated 10 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scalingβ627Updated last month
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β650Updated 9 months ago
- [Up-to-date] Awesome Agentic Deep Research Resourcesβ592Updated last week
- π Loong: Synthesize Long CoTs at Scale through Verifiers.β476Updated this week
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reβ¦β492Updated this week
- π MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agentsβ¦β680Updated this week
- Code and Data for Tau-Benchβ1,048Updated 4 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agentsβ509Updated this week
- Quantalogic ReAct Agent - Coding Agent Framework - Gives a βοΈ if you like the projectβ459Updated last month