Toloka / crowd-kit
Control the quality of your labeled data with the Python tools you already know.
☆221Updated 2 months ago
Alternatives and similar repositories for crowd-kit:
Users that are interested in crowd-kit are comparing it to the libraries listed below
- Toloka-Kit is a Python library for working with Toloka API.☆206Updated 9 months ago
- Active learning☆78Updated 2 years ago
- Tools for shrinking fastText models (in gensim format)☆178Updated 11 months ago
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆236Updated 2 weeks ago
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆91Updated 2 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆151Updated 3 months ago
- BSNLP 2021☆33Updated 5 months ago
- 2nd place solution for Next Like prediction task☆56Updated 2 years ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- Interface for easier topic modelling.☆138Updated 8 months ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated last month
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63Updated 3 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆47Updated this week
- A repository for Toloka tools.☆13Updated 9 months ago
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)☆12Updated 2 years ago
- RuSimpleSentEval (RSSE) shared task repo☆22Updated 3 years ago
- Russian Corpus of Linguistic Acceptability☆42Updated 6 months ago
- Augmentex — a library for augmenting texts with errors☆63Updated 9 months ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆61Updated 3 years ago
- Infrastructure for starting TG bot project. Postgres, Minio, Grafana, Alembic☆21Updated 2 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year
- Russian RoBERTa☆29Updated 5 years ago
- ☆57Updated last year
- A Russian data set for question answering over Wikidata☆47Updated 3 years ago
- ☆12Updated 2 years ago
- Russian Artificial Text Detection☆17Updated 2 years ago
- ABacus: fast hypothesis testing and experiment design solution☆47Updated 10 months ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆28Updated last year
- Train punctuation and capitalization models for different languages☆24Updated 3 years ago