Dataset collected from popular Russian collective blog Habrahabr.ru
☆13Oct 24, 2016Updated 9 years ago
Alternatives and similar repositories for habrahabr-dataset
Users that are interested in habrahabr-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 7 years ago
- алгоритм, занявший второе место на кон курсе http://cardioqvark.ru/challenge/☆11Apr 3, 2016Updated 10 years ago
- ☆35Sep 20, 2017Updated 8 years ago
- VK-Top is used for getting popular posts of any public available page at VK.com☆38Mar 8, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RUSSE: Russian Semantic Evaluation.☆15Mar 1, 2022Updated 4 years ago
- Mini-library for producing graph visualizations from embedding models☆28Sep 10, 2020Updated 5 years ago
- Recommender system test bench☆14Mar 8, 2019Updated 7 years ago
- Topic modeling with BigARTM: an interactive book☆61Dec 5, 2018Updated 7 years ago
- Large-Scale Graph Inference☆12Nov 6, 2024Updated last year
- ☆17Dec 16, 2015Updated 10 years ago
- Wikileaks Centipede Viewer to view Wikileaks emails☆10Jul 17, 2018Updated 7 years ago
- AdaGram (adaptive skip-gram) for Python☆74May 9, 2017Updated 9 years ago
- ☆26Feb 23, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Адаптивный сайт-конструктор для НКО☆10Oct 8, 2016Updated 9 years ago
- Natural language processing tools for English and Russian (postagging, syntax parsing, SRL, NER, language detection etc.)☆64Feb 5, 2026Updated 4 months ago
- A simple interface to the Project Gutenberg corpus.☆18Dec 23, 2015Updated 10 years ago
- A python implementation of the Habrahabr.ru API☆12Jun 28, 2016Updated 10 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆145Nov 19, 2022Updated 3 years ago
- Interface for easier topic modelling.☆143Jul 29, 2024Updated last year
- A set of vulnerable PHP scripts used to test w3af's vulnerability detection features.☆28Apr 15, 2015Updated 11 years ago
- Командный проект "Закон Джунглей". Майнор ВШЭ "Интеллектуальный анализ данных", курс "Введение в программирование"☆13Oct 12, 2016Updated 9 years ago
- JupyterHub Playbook for the Computational Models class at Berkeley☆15Feb 19, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14May 13, 2020Updated 6 years ago
- NLP course @ CS Faculty, HSE☆15Mar 4, 2020Updated 6 years ago
- Convert GitHub to Habr or Dev Markdown with additional features☆23Jul 5, 2022Updated 3 years ago
- Yet another (very simple) approach for adversarial training.☆17Oct 2, 2017Updated 8 years ago
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- ☆18Mar 20, 2019Updated 7 years ago
- Classification and aggregation of russian news articles. University coursework.☆18Jan 21, 2019Updated 7 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- Tools for fuzzy string search in text and dictionaries written in Java☆10Dec 24, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- A sentiment analysis package for R.☆23Dec 10, 2023Updated 2 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 5 years ago
- Golang DKIM Verifier☆21May 2, 2017Updated 9 years ago
- Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources☆12Apr 12, 2018Updated 8 years ago
- basically all words, in a compressed form☆17Jan 9, 2023Updated 3 years ago
- Data and code for the experiments in the Outlier Detection task proposed by Camacho-Collados et al.☆13Aug 28, 2018Updated 7 years ago