Toloka / crowd-kit
Control the quality of your labeled data with the Python tools you already know.
☆215Updated last month
Alternatives and similar repositories for crowd-kit:
Users that are interested in crowd-kit are comparing it to the libraries listed below
- Toloka-Kit is a Python library for working with Toloka API.☆203Updated 7 months ago
- Active learning☆78Updated 2 years ago
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆234Updated this week
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆91Updated 2 years ago
- BSNLP 2021☆33Updated 3 months ago
- RuSimpleSentEval (RSSE) shared task repo☆22Updated 3 years ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- Augmentex — a library for augmenting texts with errors☆61Updated 7 months ago
- A repository for Toloka tools.☆13Updated 8 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆147Updated 2 months ago
- 2nd place solution for Next Like prediction task☆54Updated 2 years ago
- Tools for shrinking fastText models (in gensim format)☆175Updated 9 months ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 4 months ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆62Updated 3 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- Interface for easier topic modelling.☆138Updated 6 months ago
- MOdel ResOurCe COnsumption. Evaluate Russian SuperGLUE models performance: inference speed, RAM usage. Reproducible scores using Docker☆22Updated 2 years ago
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Updated last year
- Efficient DL/ML Models Seminars☆27Updated last month
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆52Updated 3 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆34Updated 3 years ago
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Russian Corpus of Linguistic Acceptability☆42Updated 4 months ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆27Updated last year
- Framework for probing tasks☆25Updated 10 months ago
- YSDA course in Speech Processing.☆219Updated this week
- nlp workshop at datafest siberia 2019☆22Updated 2 years ago
- A Russian data set for question answering over Wikidata☆47Updated 3 years ago
- ☆12Updated 2 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago