Toloka/crowd-kit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Toloka/crowd-kit)

Toloka / crowd-kit

Control the quality of your labeled data with the Python tools you already know.

☆251

Alternatives and similar repositories for crowd-kit

Users that are interested in crowd-kit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Toloka / toloka-kit
View on GitHub
Toloka-Kit is a Python library for working with Toloka API.
☆212Jul 2, 2024Updated 2 years ago
maqqbu / MMSR
View on GitHub
The code for NeurIPS 2020 paper: Adversarial Crowdsourcing Through Robust Rank-One Matrix Completion.
☆10Oct 26, 2020Updated 5 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 4 years ago
Toloka / BestPrompts
View on GitHub
Best Prompts for Text-to-Image Models
☆25Jan 20, 2024Updated 2 years ago
heolin / agreement
View on GitHub
Implementation of popular agreement metrics such as Cohen kappa, Fleiss kappa, Krippendorff alpha
☆16Apr 2, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fmpr / CrowdLayer
View on GitHub
A neural network layer that enables training of deep neural networks directly from crowdsourced labels (e.g. from Amazon Mechanical Turk)…
☆69Dec 13, 2021Updated 4 years ago
utir / square-2.0
View on GitHub
SQUARE-2.0 (Statistical QUality Assurance Robustness Evaluation)
☆20Oct 26, 2015Updated 10 years ago
UKPLab / arxiv2018-bayesian-ensembles
View on GitHub
☆27Dec 23, 2023Updated 2 years ago
TrentoCrowdAI / crowdsourced-datasets
View on GitHub
Crowdsourced datasets including the individual crowd votes.
☆45Feb 3, 2020Updated 6 years ago
zhydhkcws / crowd_truth_infer
View on GitHub
This is the framework with 17 existing crowdsourced truth inference algorithms.
☆29Nov 17, 2017Updated 8 years ago
nlpub / hyperstar
View on GitHub
Hyperstar: Negative Sampling Improves Hypernymy Extraction Based on Projection Learning.
☆24Jan 2, 2020Updated 6 years ago
pilot7747 / sldl
View on GitHub
Single-line inference of SOTA deep learning models
☆28Jan 22, 2023Updated 3 years ago
UKPLab / nessie
View on GitHub
Automatically detect errors in annotated corpora.
☆48Sep 8, 2023Updated 2 years ago
AIRI-Institute / AI4TALK
View on GitHub
☆13Dec 7, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
deeppavlov / ru_sentence_tokenizer
View on GitHub
A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.
☆52Jul 4, 2018Updated 8 years ago
LuisaMaerz / KnowMAN
View on GitHub
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
☆12Nov 9, 2021Updated 4 years ago
vladislavneon / RuBQ
View on GitHub
A Russian data set for question answering over Wikidata
☆51Jun 6, 2021Updated 5 years ago
zhuowangsylu / ColluEagle
View on GitHub
Group review spammer detection
☆10Sep 9, 2019Updated 6 years ago
ritikamangla / QSalience
View on GitHub
https://arxiv.org/abs/2404.10917
☆14Mar 18, 2025Updated last year
argilla-io / distilabel-spin-dibt
View on GitHub
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Mar 12, 2024Updated 2 years ago
blester125 / iobes
View on GitHub
Tool for parsing and converting various span encoding schemes.
☆23Jan 13, 2024Updated 2 years ago
multitel-ai / urban-sound-tagging
View on GitHub
1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context
☆16Dec 8, 2022Updated 3 years ago
yandex-research / RuLeanALBERT
View on GitHub
RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.
☆92May 27, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
Esmidth / CCS_TA_implement
View on GitHub
a implement and derivation of "CCS-TA: quality-guaranteed online task allocation in compressive crowdsensing"
☆12Jun 12, 2022Updated 4 years ago
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
BinWang28 / Sentence-Embedding-S3E
View on GitHub
Efficient Sentence Embedding via Semantic Subspace Analysis
☆14Feb 25, 2020Updated 6 years ago
vinid / quica
View on GitHub
quica is a tool to run inter coder agreement pipelines in an easy and effective ways. Multiple measures are run and results are collected…
☆23Nov 9, 2020Updated 5 years ago
Illumaria / made-computer-vision
View on GitHub
Computer Vision course materials
☆23Jun 10, 2021Updated 5 years ago
IlyaGusev / russ
View on GitHub
Package for word stress detection
☆11Jan 27, 2023Updated 3 years ago
oleges1 / quartznet-pytorch
View on GitHub
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆27Jul 16, 2021Updated 5 years ago
lab260ru / balalaika
View on GitHub
[INTERSPEECH 2026] Official code for "Balalaika: Data-Centric, Prosody-Aware Annotation Pipeline for Russian Speech"
☆21Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AliOsm / semantic-question-similarity
View on GitHub
Official implementation of: Tha3aroon at NSURL-2019 Task 8: Semantic Question Similarity in Arabic
☆13Aug 2, 2024Updated last year
google / timecast
View on GitHub
Performant, composable online learning
☆16Feb 22, 2021Updated 5 years ago
VolodymyrPavliukevych / YoutubeSummarizer
View on GitHub
Simple summarize ML model
☆16Dec 21, 2018Updated 7 years ago
RossiyaSegodnya / ria_news_dataset
View on GitHub
"Rossiya Segodnya" news dataset
☆46Sep 25, 2019Updated 6 years ago
gergelyk / peepshow
View on GitHub
Python data explorer.
☆12Dec 13, 2024Updated last year
shreyasl10 / Blockchain-based-MCS-system
View on GitHub
A decentralized and privacy preserving Mobile Crowdsensing system based on Blockchain Oracles.
☆10May 23, 2021Updated 5 years ago
nlpub / rdt
View on GitHub
RDT: Russian Distributional Thesaurus (Русский Дистрибутивный Тезаурус)
☆30Feb 28, 2019Updated 7 years ago