Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.
☆17Aug 22, 2024Updated last year
Alternatives and similar repositories for BenchmarkAggregator
Users that are interested in BenchmarkAggregator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- ☆10Jun 11, 2019Updated 6 years ago
- a Haskell library that implements (Projective) Discourse Representation Theory (DRT)☆27Sep 15, 2022Updated 3 years ago
- ☆41Jan 25, 2026Updated 3 months ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14May 25, 2023Updated 2 years ago
- Data and all☆14Sep 30, 2019Updated 6 years ago
- Implement the essential operators from Allens Interval Algebra, and also some metaprogramming for combinatoral operators☆13Oct 17, 2018Updated 7 years ago
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago
- Automated Semantic Analysis of Discourse Markers☆11May 30, 2022Updated 3 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Nov 9, 2021Updated 4 years ago
- An implementation of Defeasible Logic in Python☆15Sep 2, 2018Updated 7 years ago
- Allows two LLMs to communicate and run code in the terminal☆28Dec 8, 2024Updated last year
- ☆13Jul 28, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🐍A curated list of awesome python environment.☆13Apr 21, 2020Updated 6 years ago
- ☆18Feb 29, 2024Updated 2 years ago
- Collects a multimodal dataset of Wikipedia articles and their images☆16Mar 25, 2023Updated 3 years ago
- Analytic tableau based minimal model generator, model checker and theorem prover for first-order logic with modal extensions☆20Aug 22, 2025Updated 8 months ago
- Neural Unification for Logic Reasoning over Language☆22Nov 15, 2021Updated 4 years ago
- ☆22May 4, 2024Updated last year
- Final Year Masters Project: modal logic solver tableaux☆25May 26, 2022Updated 3 years ago
- Repository for DISRPT2023 shared task☆17Jul 26, 2024Updated last year
- ☆19Dec 26, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Classical CHAT80 NLP system for Prolog☆25Feb 27, 2025Updated last year
- ☆26Aug 2, 2025Updated 9 months ago
- [ACL 2024] DiFiNet: Boundary-Aware Semantic Differentiation and Filtration Network for Nested Named Entity Recognition☆17Oct 2, 2024Updated last year
- Firecracker VM orchestration for Claude Code sessions☆27Mar 30, 2026Updated last month
- A web interactive tool for building proofs in the sequent calculus of Linear Logic, with its backend written in OCaml☆24Apr 7, 2025Updated last year
- ☆26Apr 15, 2023Updated 3 years ago
- ☆13Apr 6, 2025Updated last year
- Simple Streamlit application used for demonstrating Anthropic Claude 3 family of model's multimodal prompting on Amazon Bedrock☆17Dec 5, 2024Updated last year
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Create your own RVC v2 dataset from a youtube video☆31Jan 27, 2024Updated 2 years ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- ScribePal is an Open Source intelligent browser extension that leverages AI to empower your web experience by providing contextual insigh…☆21Apr 6, 2026Updated 3 weeks ago
- ☆18Feb 23, 2025Updated last year
- ☆12May 30, 2025Updated 11 months ago
- Write formal proofs in natural language and LaTeX.☆48Dec 18, 2025Updated 4 months ago
- ☆19Sep 24, 2024Updated last year