Toloka / crowd-kit
Control the quality of your labeled data with the Python tools you already know.
☆222Updated 3 months ago
Alternatives and similar repositories for crowd-kit:
Users that are interested in crowd-kit are comparing it to the libraries listed below
- Toloka-Kit is a Python library for working with Toloka API.☆207Updated 9 months ago
- Active learning☆78Updated 2 years ago
- BSNLP 2021☆33Updated 5 months ago
- A repository for Toloka tools.☆13Updated 10 months ago
- Russian Corpus of Linguistic Acceptability☆43Updated 6 months ago
- ☆12Updated 2 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 3 weeks ago
- Interface for easier topic modelling.☆139Updated 8 months ago
- RuSimpleSentEval (RSSE) shared task repo☆21Updated 4 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆151Updated 4 months ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆47Updated 3 weeks ago
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆91Updated 3 years ago
- Tools for shrinking fastText models (in gensim format)☆178Updated 11 months ago
- A Russian data set for question answering over Wikidata☆47Updated 3 years ago
- ☆83Updated 2 years ago
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Updated 2 years ago
- 2nd place solution for Next Like prediction task☆56Updated 2 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Updated 2 years ago
- RuREBus shared task repo☆30Updated 4 years ago
- A list of pretrained Transformer models for the Russian language.☆174Updated 5 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 6 months ago
- MOdel ResOurCe COnsumption. Evaluate Russian SuperGLUE models performance: inference speed, RAM usage. Reproducible scores using Docker☆22Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.☆125Updated 3 years ago
- A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision☆235Updated this week
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆35Updated 3 years ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆61Updated 3 years ago
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆40Updated 4 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year