An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
☆32Jan 15, 2025Updated last year
Alternatives and similar repositories for KazNERD
Users that are interested in KazNERD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NLP tools for Kazakh language☆50Nov 15, 2020Updated 5 years ago
- NLP tools for Kazakh language☆35Apr 5, 2022Updated 4 years ago
- NLA-NU Kazakh Dependency Treebank☆10Dec 23, 2018Updated 7 years ago
- Open Source Kazakh Corpus☆20Apr 25, 2023Updated 2 years ago
- Apertium linguistic data for Kazakh☆21Nov 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An open-source parallel corpus for machine translation across Kazakh, English, Russian, and Turkish☆16Mar 29, 2024Updated 2 years ago
- the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTT…☆56Jul 30, 2021Updated 4 years ago
- An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has in…☆149Aug 1, 2025Updated 8 months ago
- Kyrgyz language processing software, models and datasets.☆33Dec 12, 2025Updated 4 months ago
- Bayesian Assessment of Hypotheses☆26Jul 6, 2023Updated 2 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆24Aug 23, 2019Updated 6 years ago
- LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)☆10Oct 18, 2021Updated 4 years ago
- ☆24Sep 25, 2024Updated last year
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repo for Turkish Wiki NER dataset.☆12Jul 11, 2023Updated 2 years ago
- This repository demonstrates how to implement a Django REST-based authentication system with the django-allauth and dj-rest-auth packages…☆13Mar 4, 2024Updated 2 years ago
- Make N-Gram for Uyghur language☆15Dec 24, 2020Updated 5 years ago
- ☆13Dec 22, 2023Updated 2 years ago
- A repository to keep tools, scripts, data for SMART task.☆11May 24, 2022Updated 3 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆35Aug 1, 2025Updated 8 months ago
- This repo contains "Azure Data Engineer Associate" Questions and related docs.☆13Jan 29, 2024Updated 2 years ago
- Label dialogue with Dialogue Acts and Adjacency Pairs☆12Jun 20, 2023Updated 2 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code and experiments for the COLING2020 paper "Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations".☆11Dec 9, 2020Updated 5 years ago
- A mock social networking platform made using Node.js and MongoDB.☆17Sep 19, 2021Updated 4 years ago
- simple crawler for some uyghur website such as uy.ts.cn,bbs.bagdax.cn,www.bagdax.cn(using python and scrapy)☆11Oct 4, 2020Updated 5 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Source code for "An Empirical Study of Code Smells in Transformer-based Code Generation Techniques".☆11Oct 4, 2022Updated 3 years ago
- ☆19Apr 13, 2024Updated 2 years ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- a corpus containing 4.5K conversations from the Conversational Question-Answering dataset CoQA, for a total of 53K follow-up question-ans…☆16Jun 12, 2023Updated 2 years ago
- ☆10Sep 9, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Course materials for 11-767☆13Nov 10, 2022Updated 3 years ago
- examples of plugins for use by Coursera partners☆13Jun 18, 2025Updated 9 months ago
- ☆20May 1, 2019Updated 6 years ago
- ☆10May 26, 2022Updated 3 years ago
- Code of telegram bot for tracking your new subscribers☆11Sep 7, 2019Updated 6 years ago
- A docker for run Wine though VNC remote manage☆15Dec 29, 2018Updated 7 years ago
- Opal is a toolkit for that enables rapid deployment of scientific applications as Web services☆12Jul 11, 2022Updated 3 years ago