A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
☆32Apr 15, 2026Updated last month
Alternatives and similar repositories for clemcore
Users that are interested in clemcore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 4 months ago
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- This project collects methods that enhance the comparison between AMR graphs.☆18Jun 15, 2023Updated 2 years ago
- ☆19Apr 26, 2026Updated 3 weeks ago
- ☆19Apr 22, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Sep 8, 2022Updated 3 years ago
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- ☆21Feb 22, 2025Updated last year
- A Test Collection of Computer Science Papers for Faceted Query by Example☆23Nov 28, 2021Updated 4 years ago
- ☆12Nov 5, 2024Updated last year
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.☆12Dec 20, 2018Updated 7 years ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 11 months ago
- Analyse des Pegida facebook Korpus☆10Jan 31, 2015Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆17Apr 25, 2021Updated 5 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Apr 4, 2025Updated last year
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆15Feb 24, 2026Updated 2 months ago
- ☆19Apr 16, 2021Updated 5 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆31Jun 12, 2023Updated 2 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Apr 22, 2026Updated 3 weeks ago
- Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking☆25Jul 30, 2024Updated last year
- ☆18Oct 6, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆108Apr 18, 2026Updated last month
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Aug 12, 2022Updated 3 years ago
- Simple TTF rasterizer☆11Mar 29, 2020Updated 6 years ago
- AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction☆17Dec 23, 2021Updated 4 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆31Mar 5, 2024Updated 2 years ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated last month
- Extract clips of audio and video from source files according to a DaVinci Resolve 16 exported EDL file, using ffmpeg☆16Dec 4, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Record animations on HTML5 canvas☆14Apr 16, 2024Updated 2 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆26Apr 22, 2025Updated last year
- Kanban Tool Extension Development Kit☆10Feb 27, 2019Updated 7 years ago
- ☆27Mar 21, 2024Updated 2 years ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆21Oct 13, 2020Updated 5 years ago
- ☆22Jul 31, 2012Updated 13 years ago