agential-ai/agential

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/agential-ai/agential)

agential-ai / agential

🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!

☆54

Alternatives and similar repositories for agential

Users that are interested in agential are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JoshuaPurtell / LRCBench
View on GitHub
Evals meant to evaluate language models' ability to reason over long contexts.
☆10Sep 12, 2024Updated last year
JoshuaPurtell / SmallBench
View on GitHub
Small, simple agent task environments for training and evaluation
☆20Nov 1, 2024Updated last year
znah / tt09
View on GitHub
☆15Jun 30, 2025Updated last year
jina-ai / terminal-news
View on GitHub
Access Jina AI news via ssh guest@news.jina.ai
☆12May 3, 2024Updated 2 years ago
zlwang-cs / OfficeBench
View on GitHub
OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation
☆41Apr 1, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
phunterlau / paper_without_code
View on GitHub
LLM reads a paper and produce a working prototype
☆65Apr 12, 2025Updated last year
xxxiaol / counterfactual-recipe-generation
View on GitHub
Source code and data for Counterfactual Recipe Generation: Exploring Models’ Compositional Generalization Ability in a Realistic Scenario…
☆15Oct 25, 2022Updated 3 years ago
datacommonsorg / schema
View on GitHub
☆21Updated this week
dinobby / MAgICoRE
View on GitHub
☆23Sep 19, 2024Updated last year
mustafamariam / LLM-Connections-Solver
View on GitHub
Code for Columbia University COMS 3997 – LLM Ethics and Foundations
☆16Jan 7, 2025Updated last year
biasinrecsys / wsdm2021
View on GitHub
WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web
☆11Mar 8, 2021Updated 5 years ago
zhudotexe / redel
View on GitHub
ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)
☆95May 11, 2026Updated 2 months ago
truemagic-coder / nemo-agent
View on GitHub
Your Python AI Coder!
☆36May 21, 2025Updated last year
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
maitrix-org / dynamic-alignment-optimization
View on GitHub
[EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…
☆24Nov 17, 2024Updated last year
koaning / fh-altair
View on GitHub
Makes it easy to use altair from FastHTML
☆28Oct 9, 2024Updated last year
zbambergerNLP / strategic-debate-tot
View on GitHub
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆103Oct 3, 2025Updated 9 months ago
agentsea / surfkit
View on GitHub
A toolkit for building computer use AI agents
☆194Jun 26, 2025Updated last year
COGS108 / FinalProjects-Sp20
View on GitHub
COGS 108 Final Projects (Spring 2020) - Data Science in Practice by Prof. Shannon Ellis
☆12Aug 12, 2020Updated 5 years ago
zyxnlp / ACLT
View on GitHub
☆16Mar 25, 2022Updated 4 years ago
tjaensch / streamlit_openai_chat_with_docs
View on GitHub
Streamlit OpenAI app to chat with custom text documents of all kinds
☆13Apr 11, 2026Updated 3 months ago
KomeijiForce / MetaIE
View on GitHub
This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to a…
☆30Feb 23, 2025Updated last year
xxxiaol / magic-if
View on GitHub
Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…
☆31Jun 4, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bbc / dsrp_bbcavs10k_distribution
View on GitHub
Repo for the BBCAVS10k distribution
☆10Nov 27, 2024Updated last year
weilicao / SPScanner
View on GitHub
[COLM '25] Single-Pass Document Scanning for Question Answering
☆14Aug 20, 2025Updated 11 months ago
agentsea / osuniverse
View on GitHub
Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents
☆24May 7, 2025Updated last year
psunlpgroup / VisOnlyQA
View on GitHub
This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…
☆29Jul 9, 2025Updated last year
UCSD-Historical-Enrollment-Data / webreg_scraper
View on GitHub
A scraper and API for WebReg, UCSD's course enrollment system.
☆12Mar 7, 2026Updated 4 months ago
chili-lab / SPORTU
View on GitHub
[ICLR 2025] SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models
☆20Sep 17, 2025Updated 10 months ago
KomeijiForce / EmojiLM
View on GitHub
Official Implementation for "EmojiLM: Modeling the New Emoji Language"
☆12Feb 23, 2024Updated 2 years ago
zjunlp / WorldMind
View on GitHub
Aligning Agentic World Models via Knowledgeable Experience Learning
☆37May 15, 2026Updated 2 months ago
PathOnAIOrg / LiteMultiAgent
View on GitHub
The Library for LLM-based multi-agent applications
☆103Jul 18, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zjunlp / SemEval2021Task4
View on GitHub
The 4th rank system of the SemEval 2021 Task4.
☆10May 7, 2022Updated 4 years ago
planned-diffusion / planned-diffusion
View on GitHub
☆20Nov 14, 2025Updated 8 months ago
biasinrecsys / umap2020
View on GitHub
ACM UMAP2020 Hands-on Tutorial on Data and Algorithmic Bias in Recommender Systems
☆10May 23, 2021Updated 5 years ago
MohammadHeydari / Persian_FastText
View on GitHub
Persian Word Embedding Using FastText Pre-trained Model
☆13May 29, 2026Updated 2 months ago
weaviate / structured-rag
View on GitHub
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆121Apr 27, 2026Updated 3 months ago
deep-diver / TFX-WandB
View on GitHub
☆15Jun 30, 2023Updated 3 years ago
Tebmer / Rereading-LLM-Reasoning
View on GitHub
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆30Dec 10, 2024Updated last year