lolpa1n / digital-peter-ocrv
1st place (public LB) solution of AIJ2020 Sberbank competition (Digital Peter)
☆18Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for digital-peter-ocrv
- Repository containing our datasets for HTR (handwritten text recognition) task.☆24Updated 2 years ago
- Official baseline solutions to Yandex Cup ML challenge☆31Updated 3 years ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- Russian RoBERTa☆29Updated 4 years ago
- ☆42Updated last year
- https://arxiv.org/abs/2201.06499☆28Updated 7 months ago
- Augmentex — a library for augmenting texts with errors☆52Updated 4 months ago
- Some augmentations that I hasn't found in other repositories and libraries.☆26Updated last year
- An easy-to-run OCR model pipeline based on CRNN and CTC loss☆45Updated last year
- Text reading pipeline that combines segmentation and OCR-models.☆26Updated last year
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆32Updated 3 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago
- nlp workshop at datafest siberia 2019☆22Updated last year
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆100Updated 3 years ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆63Updated 3 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- ☆13Updated last year
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- Accentor and transcriptor for Russian language☆118Updated 2 years ago
- Infrastructure for starting TG bot project. Postgres, Minio, Grafana, Alembic☆21Updated 2 years ago
- CLIP implementation for Russian language☆139Updated last year
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆17Updated last year
- Probing suite for evaluation of Russian embedding and language models☆32Updated last month
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 2 years ago
- Примеры distributed machine learning с помощью сервиса AICloud☆34Updated last month
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93Updated last year
- Git Hooks Tutorial.☆18Updated 2 years ago