DigitalHarborFoundation / FlexEvalLinks
FlexEval is an LLM evaluation tool designed for practical quantitative analysis.
☆12Updated last week
Alternatives and similar repositories for FlexEval
Users that are interested in FlexEval are comparing it to the libraries listed below
Sorting:
- SpanMarker for Named Entity Recognition☆446Updated 7 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆434Updated last year
- Code examples and jupyter notebooks for the Cohere Platform☆505Updated 6 months ago
- A collection of datasets and tasks for legal machine learning☆391Updated last year
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆736Updated 7 months ago
- Late Interaction Models Training & Retrieval☆524Updated this week
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆98Updated 3 months ago
- ☆33Updated 2 years ago
- Clustering sentence embeddings to extract message intent☆175Updated 3 years ago
- ☆47Updated this week
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- 🦙 Integrating LLMs into structured NLP pipelines☆1,296Updated 7 months ago
- An open science effort to benchmark legal reasoning in foundation models☆463Updated 11 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- Deliver safe & effective language models☆532Updated last week
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆581Updated this week
- Neural Search☆363Updated 5 months ago
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆186Updated last year
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆63Updated 5 months ago
- ☆207Updated last year
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆884Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆327Updated 9 months ago
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆443Updated 2 weeks ago
- Prompt templating and versioning using jinja2 and litellm 🔥☆16Updated last year
- ☆167Updated last year
- ☆367Updated last year
- The robust European language model benchmark.☆114Updated last week
- ☆195Updated last year
- Bayesian IRT models in Python☆150Updated last month
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"☆366Updated last year