A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
☆32Mar 26, 2026Updated this week
Alternatives and similar repositories for clemcore
Users that are interested in clemcore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- ☆14Apr 10, 2024Updated last year
- ☆19Sep 16, 2025Updated 6 months ago
- PyPremise - Python tool for the Premise algorithm to identify patterns or explanations of where a machine learning classifier performs we…☆22Oct 27, 2025Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Jul 26, 2023Updated 2 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆23Nov 28, 2021Updated 4 years ago
- ☆12Nov 5, 2024Updated last year
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.☆11Dec 20, 2018Updated 7 years ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking☆13Apr 11, 2025Updated 11 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- Python library to add support for embedding natural code in Python with shared program state.☆24Jan 20, 2026Updated 2 months ago
- Video production for developers☆37Mar 19, 2026Updated last week
- Analyse des Pegida facebook Korpus☆10Jan 31, 2015Updated 11 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆36Jan 25, 2026Updated 2 months ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆17Apr 25, 2021Updated 4 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- ☆17Aug 30, 2025Updated 6 months ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Apr 4, 2025Updated 11 months ago
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing☆14Jun 25, 2023Updated 2 years ago
- ☆18Apr 16, 2021Updated 4 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆31Jun 12, 2023Updated 2 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking☆26Jul 30, 2024Updated last year
- ☆18Oct 6, 2022Updated 3 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Aug 12, 2022Updated 3 years ago
- AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction☆17Dec 23, 2021Updated 4 years ago
- Simple TTF rasterizer☆11Mar 29, 2020Updated 6 years ago
- Guess the Hacker News titles☆12Mar 24, 2022Updated 4 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆24Apr 22, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆18Oct 17, 2022Updated 3 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- Curated list of awesome datasets for various table understanding tasks☆18Sep 5, 2025Updated 6 months ago
- A prompt defence is a multi-layer defence that can be used to protect your applications against prompt injection attacks.☆21Mar 18, 2026Updated last week
- Kanban Tool Extension Development Kit☆10Feb 27, 2019Updated 7 years ago
- ☆27Mar 21, 2024Updated 2 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago