TheAtticusProject / maud
β77Updated 2 years ago
Alternatives and similar repositories for maud:
Users that are interested in maud are comparing it to the libraries listed below
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β104Updated 9 months ago
- Generalist and Lightweight Model for Text Classificationβ90Updated this week
- Vespa application making an index of the CORD-19 dataset.β39Updated last month
- π« SpaCy wrapper for ConceptNet π«β90Updated last year
- β41Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β151Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated 11 months ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)β60Updated last year
- β42Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ66Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β28Updated 2 months ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- β47Updated last year
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLPβ21Updated last year
- Explainable Zero-Shot Topic Extractionβ62Updated 6 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β330Updated last year
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ189Updated 5 months ago
- Experiments with generating opensource language model assistantsβ97Updated last year
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated 6 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 11 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ175Updated 2 months ago
- Pre-train Static Word Embeddingsβ48Updated this week
- This is the repo for the container that holds the models for the text2vec-transformers moduleβ49Updated last month
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago