A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark
☆32Feb 20, 2026Updated last week
Alternatives and similar repositories for clemcore
Users that are interested in clemcore are comparing it to the libraries listed below
Sorting:
- ☆12Nov 5, 2024Updated last year
- ☆14May 7, 2025Updated 9 months ago
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 2 months ago
- Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…☆12Oct 21, 2022Updated 3 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Sep 8, 2022Updated 3 years ago
- Python library to add support for embedding natural code in Python with shared program state.☆23Jan 20, 2026Updated last month
- Video production for developers☆34Feb 20, 2026Updated last week
- ☆19Sep 16, 2025Updated 5 months ago
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- PyPremise - Python tool for the Premise algorithm to identify patterns or explanations of where a machine learning classifier performs we…☆22Oct 27, 2025Updated 4 months ago
- A prompt defence is a multi-layer defence that can be used to protect your applications against prompt injection attacks.☆21Dec 12, 2025Updated 2 months ago
- AFlow & MathAI☆19Feb 24, 2025Updated last year
- ☆52Jul 4, 2023Updated 2 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 8 months ago
- ☆27Mar 21, 2024Updated last year
- A collection of beautiful, ready-made Liquid Glass UI components you can preview, copy, and drop into any web app. It offers a refined fr…☆73Feb 23, 2026Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated 10 months ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Mar 3, 2025Updated last year
- ☆37Oct 15, 2024Updated last year
- A set of tools to create synthetically-generated data from documents☆39Aug 15, 2025Updated 6 months ago
- A simple, 100% Rust implementation of a vector storage database with on disk persistency.☆31Jul 5, 2024Updated last year
- Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking☆25Jul 30, 2024Updated last year
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆31Jun 12, 2023Updated 2 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Ludus FastMCP enables AI-powered management of Ludus cyber ranges through natural language commands. The server exposes **157 tools** acr…☆72Dec 31, 2025Updated 2 months ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Mod merging tool for The Witcher 3: Wild Hunt [C++, Qt5]☆12Nov 4, 2016Updated 9 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- TOON as DSPy adapter☆25Feb 1, 2026Updated last month
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- Low code framework to build and launch a crew of AI agents with shared state. Built with https://axllm.dev.☆41Feb 11, 2026Updated 3 weeks ago
- ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…☆114Jan 30, 2026Updated last month
- ☆89Oct 7, 2025Updated 4 months ago
- MCP server for Grok AI API integration☆21Jun 2, 2025Updated 9 months ago
- ☆17Jun 8, 2025Updated 8 months ago