TheAtticusProject / maud
β75Updated last year
Related projects β
Alternatives and complementary repositories for maud
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated last year
- A Python library aimed at dissecting and augmenting NER training data.β56Updated last year
- Generalist and Lightweight Model for Text Classificationβ49Updated this week
- π« SpaCy wrapper for ConceptNet π«β88Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β103Updated 6 months ago
- β66Updated this week
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β23Updated 3 months ago
- A diff tool for language modelsβ42Updated 10 months ago
- Experiments with generating opensource language model assistantsβ97Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β92Updated last year
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)β60Updated last year
- Vespa application making an index of the CORD-19 dataset.β39Updated this week
- Web UI & Backend for Data Annotations in Ayaβ26Updated 8 months ago
- β48Updated 2 weeks ago
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLPβ20Updated 10 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β27Updated 2 months ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 9 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ149Updated 4 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ122Updated 8 months ago
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated 7 months ago
- Documentation effort for the BookCorpus datasetβ33Updated 3 years ago
- β45Updated 2 years ago
- Efficiently find the best-suited language model (LM) for your NLP taskβ91Updated this week
- β83Updated 2 months ago
- β42Updated last year
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 8 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β72Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)β90Updated 8 months ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.β55Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 5 months ago