๐ง Compare how Agent systems perform on several benchmarks. ๐๐
โ103Aug 4, 2025Updated 7 months ago
Alternatives and similar repositories for agent_reasoning_benchmark
Users that are interested in agent_reasoning_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Beating the GAIA benchmark with Transformers Agents. ๐โ150Feb 19, 2025Updated last year
- โ18Jun 26, 2024Updated last year
- Baker is an AI powered app that helps you find recipes and avoid food wasteโ14Jan 4, 2025Updated last year
- โ15Jan 19, 2023Updated 3 years ago
- LLM as a Chatbot Serviceโ17Aug 28, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive โข AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The motion model software development kit for the LandSAR search and rescue software platformโ18Feb 3, 2026Updated last month
- The LandSAR search and rescue platformโ11Dec 19, 2025Updated 3 months ago
- โ25May 28, 2025Updated 9 months ago
- โ12May 7, 2022Updated 3 years ago
- โ126Aug 13, 2024Updated last year
- Sets up ComfyUI on MacOS/Linux/Windows and runs a workflow json.โ32May 7, 2025Updated 10 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology Viewโ120Jun 6, 2025Updated 9 months ago
- Synthetic QA generation for long documents.โ16Jul 22, 2022Updated 3 years ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"โ13Jun 22, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- โ24Oct 23, 2025Updated 5 months ago
- โ27Apr 3, 2025Updated 11 months ago
- website repo for agent-based social movement simulationโ27Jun 17, 2024Updated last year
- โ13May 16, 2019Updated 6 years ago
- This project contains the original white paper for Language Construct Modeling (LCM) v1.13, authored by Vincent Shing Hin Chong. It introโฆโ15Jul 23, 2025Updated 8 months ago
- You like pytorch? You like micrograd? You love tinygrad! โค๏ธโ18Feb 14, 2025Updated last year
- Bridge for audio transcription between Open-WebUI and Whisper, returns text in JSON format.โ16Nov 5, 2024Updated last year
- โ10Oct 18, 2021Updated 4 years ago
- Generate Python docstrings automatically with LLM and syntax treesโ20Jun 13, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- โ97Dec 16, 2024Updated last year
- We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantitiโฆโ11Mar 9, 2021Updated 5 years ago
- This repository contains PyTorch implemenation of WWW 2023 research paper: Optimizing Feature Set for Click-through Rate Prediction.โ12Oct 23, 2023Updated 2 years ago
- Community evolution on Stack Overflowโ10Dec 10, 2018Updated 7 years ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environmentsโ14Oct 25, 2023Updated 2 years ago
- ไธไธชๅบไบ Flask ็้ฎๅท่ฐๆฅๅบ็จใโ11Feb 2, 2023Updated 3 years ago
- โ11Dec 11, 2024Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPOโ116Dec 30, 2023Updated 2 years ago
- Closed-loop simulator of complex behavior and learning based on reinforcement learning and deep neural networksโ12Mar 20, 2026Updated last week
- DigitalOcean Gradient AI Platform โข AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Task-Guided Pair Embedding in Heterogeneous Network (CIKM 2019)โ12Aug 19, 2021Updated 4 years ago
- Metaprompt is an AI-powered prompt generator developed by Anthropic. This is the unofficial Metaprompt Community Github repo. All PRs areโฆโ13Mar 19, 2024Updated 2 years ago
- ๐ฑ Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMsโ71Mar 21, 2025Updated last year
- โ11Oct 17, 2024Updated last year
- Multi-objective reinforcement learning for covid-19 controlโ12Aug 12, 2021Updated 4 years ago
- โ14Dec 26, 2023Updated 2 years ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]โ148Nov 26, 2024Updated last year