Dataset collected from popular Russian collective blog Habrahabr.ru
☆13Oct 24, 2016Updated 9 years ago
Alternatives and similar repositories for habrahabr-dataset
Users that are interested in habrahabr-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 7 years ago
- алгоритм, занявший второе место на кон курсе http://cardioqvark.ru/challenge/☆11Apr 3, 2016Updated 10 years ago
- ☆35Sep 20, 2017Updated 8 years ago
- VK-Top is used for getting popular posts of any public available page at VK.com☆38Mar 8, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- RUSSE: Russian Semantic Evaluation.☆16Mar 1, 2022Updated 4 years ago
- Mini-library for producing graph visualizations from embedding models☆28Sep 10, 2020Updated 5 years ago
- Recommender system test bench☆14Mar 8, 2019Updated 7 years ago
- Topic modeling with BigARTM: an interactive book☆61Dec 5, 2018Updated 7 years ago
- Large-Scale Graph Inference☆12Nov 6, 2024Updated last year
- Simulation of nervous system☆10Oct 30, 2016Updated 9 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Natural language processing tools for English and Russian (postagging, syntax parsing, SRL, NER, language detection etc.)☆65Feb 5, 2026Updated 2 months ago
- A simple interface to the Project Gutenberg corpus.☆17Dec 23, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Corpus of Russian news articles collected from Lenta.Ru☆145Nov 19, 2022Updated 3 years ago
- Interface for easier topic modelling.☆143Jul 29, 2024Updated last year
- A Parallel Russian-Simple Russian Dataset☆17Mar 30, 2023Updated 3 years ago
- A set of vulnerable PHP scripts used to test w3af's vulnerability detection features.☆28Apr 15, 2015Updated 11 years ago
- Coursera☆12Jan 26, 2016Updated 10 years ago
- ☆14May 13, 2020Updated 5 years ago
- NLP course @ CS Faculty, HSE☆15Mar 4, 2020Updated 6 years ago
- Convert GitHub to Habr or Dev Markdown with additional features☆23Jul 5, 2022Updated 3 years ago
- A complete computer science study plan to become a software engineer.☆12Feb 13, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ANYKS Spell-Checker☆19Jan 3, 2023Updated 3 years ago
- Classification and aggregation of russian news articles. University coursework.☆18Jan 21, 2019Updated 7 years ago
- My first habrahabr post☆14May 5, 2016Updated 9 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- Audio captioning RNN model in Keras☆14Aug 27, 2016Updated 9 years ago
- Tools for fuzzy string search in text and dictionaries written in Java☆10Dec 24, 2015Updated 10 years ago
- Get Mixpanel event data to Pandas for custom analysis☆19Oct 13, 2018Updated 7 years ago
- Recommender systems in Python☆50Jan 21, 2015Updated 11 years ago
- Papers sources, pictures, presentations, and other stuff☆24Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A DockerSwarm Jupyterhub setup, which uses a NFS Server running in a Docker Container for persistent storage☆20Apr 15, 2018Updated 8 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 5 years ago
- Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources☆12Apr 12, 2018Updated 8 years ago
- basically all words, in a compressed form☆17Jan 9, 2023Updated 3 years ago
- Data and code for the experiments in the Outlier Detection task proposed by Camacho-Collados et al.☆13Aug 28, 2018Updated 7 years ago
- Evaluation tools for the RUSSE evaluation campaign.☆37Jun 11, 2017Updated 8 years ago