Toloka / crowd-kitLinks
Control the quality of your labeled data with the Python tools you already know.
☆227Updated last month
Alternatives and similar repositories for crowd-kit
Users that are interested in crowd-kit are comparing it to the libraries listed below
Sorting:
- Toloka-Kit is a Python library for working with Toloka API.☆210Updated 11 months ago
- Active learning☆78Updated 2 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆21Updated 2 months ago
- Interface for easier topic modelling.☆140Updated 10 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆155Updated 6 months ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆62Updated 3 years ago
- ☆12Updated 3 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- Tools for shrinking fastText models (in gensim format)☆178Updated last year
- ☆29Updated 2 years ago
- Russian Corpus of Linguistic Acceptability☆44Updated 8 months ago
- ☆57Updated last year
- BSNLP 2021☆33Updated 7 months ago
- A repository for Toloka tools.☆13Updated last year
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆238Updated 2 weeks ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆31Updated last year
- 2nd place solution for Next Like prediction task☆57Updated 2 years ago
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Updated 2 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆157Updated 2 years ago
- Russian RoBERTa☆29Updated 5 years ago
- Train punctuation and capitalization models for different languages☆25Updated 3 years ago
- Russian coreference resolution competition☆11Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 8 months ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆35Updated 3 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆51Updated 3 years ago
- Framework for probing tasks☆27Updated last year
- RuSimpleSentEval (RSSE) shared task repo☆21Updated 4 years ago
- nlp workshop at datafest siberia 2019☆22Updated 2 years ago