dayyass / latent-semantic-analysis
Pipeline for training LSA models using Scikit-Learn.
☆24Updated 2 years ago
Related projects: ⓘ
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆63Updated 2 years ago
- Git Hooks Tutorial.☆18Updated 2 years ago
- Distributed File System written in Python☆15Updated 2 years ago
- Graph-Based Clustering using connected components and spanning trees.☆26Updated 2 years ago
- Pipeline for training Stanford Seq2Seq Neural Machine Translation using PyTorch.☆13Updated 3 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆52Updated 3 years ago
- Pipeline for training Language Models using PyTorch.☆12Updated 2 years ago
- Reinforcement Learning Library.☆29Updated 2 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Updated 3 years ago
- Pipeline for training NER models using PyTorch.☆54Updated 2 years ago
- ☆23Updated last year
- nlp workshop at datafest siberia 2019☆22Updated last year
- ☆21Updated 3 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆30Updated 2 years ago
- Layers, datasets and utilities for PyTorch☆10Updated 9 months ago
- Official baseline solutions to Yandex Cup ML challenge☆30Updated 2 years ago
- Project on solving the Calculus of variations problems using symbolic mathematics (2018).☆16Updated 7 months ago
- RuREBus shared task repo☆30Updated 3 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- ☆17Updated 3 years ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 2 years ago
- ☆25Updated 2 months ago
- Russian RoBERTa☆29Updated 4 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated last month
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- A barebones (Distil)BERT pipeline for token classification tasks driven by catalyst☆13Updated 4 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10Updated last year
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆17Updated last year
- Infrastructure for starting TG bot project. Postgres, Minio, Grafana, Alembic☆21Updated 2 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 2 years ago