Custom Russian tokenizer for spaCy
☆44May 14, 2019Updated 7 years ago
Alternatives and similar repositories for spacy_russian_tokenizer
Users that are interested in spacy_russian_tokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Russian language models for spaCy☆240Jul 14, 2021Updated 4 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- nlp workshop at datafest siberia 2019☆22Dec 8, 2022Updated 3 years ago
- Morphological Analyzer for Russian 💬☆41Jul 14, 2021Updated 4 years ago
- Pre-trained models for tokenization, sentence segmentation and so on☆15Aug 22, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Comparing quality and performance of NLP systems for Russian language☆50Jul 24, 2023Updated 2 years ago
- ☆18May 8, 2018Updated 8 years ago
- Узнай, хорошо или плохо говорят о тебе или твоей фирме в Интернете! Наша "Сорока" с искусственным интеллектом принесёт тебе это на своём …☆19May 24, 2018Updated 8 years ago
- RuREBus shared task repo☆28Jan 18, 2021Updated 5 years ago
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆42Oct 10, 2025Updated 8 months ago
- Russian Law as Open Data☆62May 15, 2026Updated last month
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- ☆56May 12, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Oct 3, 2020Updated 5 years ago
- Links to Russian corpora + Python functions for loading and parsing☆313Apr 21, 2026Updated last month
- Part-of-Speech Tagger for Russian language☆23Jul 29, 2020Updated 5 years ago
- ☆50Nov 20, 2017Updated 8 years ago
- Wrapper of DevExtreme components for Plotly Dash☆11Mar 31, 2020Updated 6 years ago
- a small collection of models implemented in keras, including matrix factorization(recommendation system), topic modeling, text classifica…☆14Jul 12, 2017Updated 8 years ago
- KataLib are many programs in one Application: Librarian, Player, YouTube downloader, Converter, MetaData editor and more..☆17Updated this week
- python port of arc90's readability bookmarklet, updated to match latest readability.js!☆19Sep 13, 2011Updated 14 years ago
- Official releases of the TOROT treebank☆10Jan 16, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Term extraction for Russian language☆91Dec 1, 2018Updated 7 years ago
- PyCon 2016 Tutorial Session -- Making Connections with Natural Language Processing☆12May 26, 2016Updated 10 years ago
- Compact high quality word embeddings for Russian language☆219Apr 13, 2026Updated 2 months ago
- Data and code for the experiments in: "German in Flux: Detecting Metaphoric Change via Word Entropy". Dominik Schlechtweg, Stefanie Eckma…☆10Aug 26, 2019Updated 6 years ago
- ☆35Dec 8, 2022Updated 3 years ago
- A programming language for generative music composition using cellular automata☆17Apr 17, 2013Updated 13 years ago
- Программирование и теория алгоритмов 2019-2020, ФиКЛ ВШЭ☆12Jun 9, 2020Updated 6 years ago
- Topic modeling with BigARTM: an interactive book☆61Dec 5, 2018Updated 7 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,336Apr 13, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Database for experiments with russian voxforge audio data (http://voxforge.org/ru/downloads).☆14Aug 31, 2021Updated 4 years ago
- Hungarian tokenizer.☆14Mar 15, 2022Updated 4 years ago
- A system for word sense induction and disambiguation based on JoBimText approach☆16Feb 14, 2018Updated 8 years ago
- NLP for Proteins - A paper collection☆13Sep 10, 2020Updated 5 years ago
- Rule-based token, sentence segmentation for Russian language☆284Apr 13, 2026Updated 2 months ago
- ☆14Dec 20, 2021Updated 4 years ago
- ☆30Aug 25, 2021Updated 4 years ago