ππ§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
β53Jul 9, 2025Updated 8 months ago
Alternatives and similar repositories for agential
Users that are interested in agential are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automationβ33May 23, 2025Updated 10 months ago
- β15Jun 30, 2025Updated 8 months ago
- Small, simple agent task environments for training and evaluationβ19Nov 1, 2024Updated last year
- Open Agent Computer Interfaceβ94Nov 26, 2024Updated last year
- code for "Natural Language to Code Translation with Execution"β41Nov 2, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β45Feb 15, 2024Updated 2 years ago
- LLM reads a paper and produce a working prototypeβ62Apr 12, 2025Updated 11 months ago
- [ICLR 2025] SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Modelsβ17Sep 17, 2025Updated 6 months ago
- Repository with the code for training and evaluating the baseline model of the SynthSOD dataset.β12Oct 13, 2025Updated 5 months ago
- β23Sep 19, 2024Updated last year
- Access Jina AI news via ssh guest@news.jina.aiβ13May 3, 2024Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)β37Dec 29, 2024Updated last year
- Source code and data for Counterfactual Recipe Generation: Exploring Modelsβ Compositional Generalization Ability in a Realistic Scenarioβ¦β15Oct 25, 2022Updated 3 years ago
- β31Jul 3, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- β31Jan 18, 2025Updated last year
- Code for Columbia University COMS 3997 β LLM Ethics and Foundationsβ14Jan 7, 2025Updated last year
- Your Python AI Coder!β36May 21, 2025Updated 10 months ago
- WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Webβ11Mar 8, 2021Updated 5 years ago
- A Python library to orchestrate LLMs in a neural network-inspired structureβ52Oct 4, 2024Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β93Dec 12, 2025Updated 3 months ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.β13Apr 21, 2022Updated 3 years ago
- π AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resourceβ¦β392Feb 17, 2026Updated last month
- A Docusaurus plugin that generates a concatenated markdown file from your documentation under /llms.txtβ31Nov 15, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-β¦β25Nov 17, 2024Updated last year
- β15Sep 24, 2022Updated 3 years ago
- Natural language dataset for training a Conversational Recommender Systemβ11Jul 9, 2019Updated 6 years ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ98Oct 3, 2025Updated 5 months ago
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to aβ¦β29Feb 23, 2025Updated last year
- Demonstration-Free: Towards More Practical Log Parsing with Large Language Modelsβ26Jun 17, 2025Updated 9 months ago
- Makes it easy to use altair from FastHTMLβ28Oct 9, 2024Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language modelsβ10Feb 20, 2025Updated last year
- π Logging utilities for spaCyβ12Nov 3, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023β¦β31Jun 4, 2023Updated 2 years ago
- [COLM '25] Single-Pass Document Scanning for Question Answeringβ12Aug 20, 2025Updated 7 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception oβ¦β29Jul 9, 2025Updated 8 months ago
- Official Implementation for "EmojiLM: Modeling the New Emoji Language"β12Feb 23, 2024Updated 2 years ago
- The 4th rank system of the SemEval 2021 Task4.β10May 7, 2022Updated 3 years ago
- ACM UMAP2020 Hands-on Tutorial on Data and Algorithmic Bias in Recommender Systemsβ10May 23, 2021Updated 4 years ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ118Apr 9, 2025Updated 11 months ago