Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке
☆36Oct 6, 2021Updated 4 years ago
Alternatives and similar repositories for gazeta
Users that are interested in gazeta are comparing it to the libraries listed below
Sorting:
- ☆18Jun 18, 2021Updated 4 years ago
- Russian coreference resolution competition☆11Mar 24, 2023Updated 2 years ago
- Models for automatic abstractive summarization☆174Jul 3, 2022Updated 3 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- Russian coreference resolution made as simple and accessible as could be☆12Sep 3, 2022Updated 3 years ago
- Repository with illustrations for cft-contest-2018☆12Nov 22, 2018Updated 7 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated last month
- Mini-library for producing graph visualizations from embedding models☆28Sep 10, 2020Updated 5 years ago
- Meeting summary from meeting transcript using LLM via OpenAI-like completion API☆43Dec 2, 2025Updated 3 months ago
- Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa☆14May 22, 2023Updated 2 years ago
- Noise-Contrastive Visualization☆55Nov 25, 2023Updated 2 years ago
- Скрипты с примерами кода из книги "Визуализация данных с помощью ggplot2"☆10Jan 7, 2019Updated 7 years ago
- ☆14Apr 13, 2020Updated 5 years ago
- Russian dialog datasets parsers and crawlers.☆15Sep 6, 2021Updated 4 years ago
- The broad index of NLP resources for Eastern European languages. The best EEML 2021 project.☆19Jun 24, 2022Updated 3 years ago
- Deep Learning based NLP modeling for Russian language☆242Jul 24, 2023Updated 2 years ago
- Links to Russian corpora + Python functions for loading and parsing☆309Feb 9, 2026Updated 3 weeks ago
- ☆33Sep 20, 2017Updated 8 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Jun 27, 2023Updated 2 years ago
- ☆17Oct 9, 2023Updated 2 years ago
- Sberbank Data Science Journey 2018 LightGBM Baseline☆20Oct 1, 2018Updated 7 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Jul 25, 2024Updated last year
- The tiniest sentence encoder for Russian language☆247Jul 25, 2024Updated last year
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆41Mar 18, 2021Updated 4 years ago
- Russian SuperGLUE benchmark☆112Jun 12, 2023Updated 2 years ago
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Jan 3, 2026Updated 2 months ago
- ☆56Mar 10, 2021Updated 4 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆125Jan 21, 2023Updated 3 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Jul 21, 2023Updated 2 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆50Apr 2, 2025Updated 11 months ago
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- ☆25Jul 11, 2024Updated last year
- ☆30Dec 25, 2022Updated 3 years ago
- A Russian data set for question answering over Wikidata☆49Jun 6, 2021Updated 4 years ago
- Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris …☆49Aug 1, 2024Updated last year
- ☆65Jan 20, 2026Updated last month
- MOdel ResOurCe COnsumption. Evaluate Russian SuperGLUE models performance: inference speed, RAM usage. Reproducible scores using Docker☆24Oct 11, 2022Updated 3 years ago