ππ§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
β53Jul 9, 2025Updated 9 months ago
Alternatives and similar repositories for agential
Users that are interested in agential are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evals meant to evaluate language models' ability to reason over long contexts.β10Sep 12, 2024Updated last year
- OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automationβ35Apr 1, 2026Updated 2 weeks ago
- Small, simple agent task environments for training and evaluationβ19Nov 1, 2024Updated last year
- Jason Meridth's blogβ13Updated this week
- Open Agent Computer Interfaceβ97Nov 26, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- code for "Natural Language to Code Translation with Execution"β41Nov 2, 2022Updated 3 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β45Feb 15, 2024Updated 2 years ago
- LLM reads a paper and produce a working prototypeβ62Apr 12, 2025Updated last year
- Repository with the code for training and evaluating the baseline model of the SynthSOD dataset.β12Oct 13, 2025Updated 6 months ago
- β23Sep 19, 2024Updated last year
- Access Jina AI news via ssh guest@news.jina.aiβ13May 3, 2024Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)β37Dec 29, 2024Updated last year
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?β13Aug 16, 2023Updated 2 years ago
- β32Jul 3, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β31Jan 18, 2025Updated last year
- β21Apr 9, 2026Updated last week
- Code for Columbia University COMS 3997 β LLM Ethics and Foundationsβ15Jan 7, 2025Updated last year
- β108Oct 9, 2025Updated 6 months ago
- Modern markdown blogging platform built with Next.js 14 and Supabase. Features rich content editing with live preview, one-click SEO optiβ¦β19Apr 8, 2026Updated last week
- Your Python AI Coder!β36May 21, 2025Updated 10 months ago
- WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Webβ11Mar 8, 2021Updated 5 years ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β92Apr 1, 2026Updated 2 weeks ago
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).β20Oct 10, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Training GPTs to solve interaction netsβ18Aug 14, 2024Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structureβ53Oct 4, 2024Updated last year
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learningβ16Apr 30, 2023Updated 2 years ago
- β16Mar 25, 2022Updated 4 years ago
- Natural language dataset for training a Conversational Recommender Systemβ11Jul 9, 2019Updated 6 years ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ99Oct 3, 2025Updated 6 months ago
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to aβ¦β29Feb 23, 2025Updated last year
- Demonstration-Free: Towards More Practical Log Parsing with Large Language Modelsβ28Jun 17, 2025Updated 9 months ago
- A toolkit for building computer use AI agentsβ193Jun 26, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.β38Sep 11, 2024Updated last year
- The Library for LLM-based multi-agent applicationsβ102Jul 18, 2025Updated 8 months ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023β¦β31Jun 4, 2023Updated 2 years ago
- AI powered Chatbot with real time updates.β75Oct 25, 2024Updated last year
- [COLM '25] Single-Pass Document Scanning for Question Answeringβ13Aug 20, 2025Updated 7 months ago
- Repo for the BBCAVS10k distributionβ10Nov 27, 2024Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception oβ¦β29Jul 9, 2025Updated 9 months ago