AzBuki-ML / public-data
Custom-built Bulgarian language data sets, used by АзБуки.ML for sentiment analysis, text classification, summarisation and generation. Open-source & free to use in any ML project.
☆18Updated last year
Alternatives and similar repositories for public-data:
Users that are interested in public-data are comparing it to the libraries listed below
- Collection and resources for Bulgarian Corpus, Datasets and Models used in ASR, TTS or NLP tasks together with the links of corresponding…☆24Updated 4 years ago
- Bulgarian wordlists (списък с думи на Български език)☆85Updated 2 years ago
- Ancient Greek lemmatisation tool☆22Updated 3 years ago
- Shobhika is a Devanāgarī font for scholars.☆36Updated 5 years ago
- Ponomar: a liturgics suite for the Orthodox Church☆39Updated 2 months ago
- Extract data from German Wiktionary XML files.☆26Updated 3 weeks ago
- Tool(s) to help read Sanskrit (and other) metrical verse☆73Updated 3 months ago
- ☆10Updated 6 years ago
- Libraries and command-line tools for metrical analysis of epic Greek hexameter☆26Updated 6 years ago
- Open morphology for Finnish☆87Updated last week
- "Fundamentals of Computer Programming with C#" Book☆13Updated 4 years ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- ☆18Updated 3 weeks ago
- A PHP library for comparing two or more Sanskrit TEI XML files and generating an apparatus with variants☆11Updated 3 months ago
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆33Updated last month
- Lexical data at Unicode☆67Updated 5 months ago
- All the words from Google Books, sorted by frequency☆112Updated last year
- Записки и полезни фрагменти код за упражненията по Дизайн и Анализ на Алгоритми, 2024г.☆18Updated 10 months ago
- Corpus of Egyptian Texts for the AED - Ancient Egyptian Dictionary☆16Updated last year
- речник с грижливо подбирани преводи на често срещани понятия от света на ИТ. приемат се предложения. прочетете по-долу как можете дас е в…☆209Updated this week
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆22Updated 7 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated 2 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago
- German part-of-speech dictionary☆43Updated last year
- Browser extension adding shortcuts to DWDS queries☆8Updated last month
- Data for the quantitative study of (Vedic) Sanskrit☆116Updated 3 months ago
- Neural based model for automatic diacritics restoration.☆26Updated 6 years ago
- A reliable diacritics database with their associated ASCII characters☆11Updated 4 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆22Updated 2 years ago