An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
☆33Jan 15, 2025Updated last year
Alternatives and similar repositories for KazNERD
Users that are interested in KazNERD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NLP tools for Kazakh language☆52Nov 15, 2020Updated 5 years ago
- NLA-NU Kazakh Dependency Treebank☆10Dec 23, 2018Updated 7 years ago
- Open Source Kazakh Corpus☆21Apr 25, 2023Updated 3 years ago
- Apertium linguistic data for Kazakh☆23Nov 1, 2023Updated 2 years ago
- An open-source parallel corpus for machine translation across Kazakh, English, Russian, and Turkish☆17Mar 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTT…☆56Jul 30, 2021Updated 4 years ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆84Aug 21, 2023Updated 2 years ago
- An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has in…☆153Aug 1, 2025Updated 10 months ago
- Kyrgyz language processing software, models and datasets.☆33May 8, 2026Updated last month
- Headphone-use screening test developed by Chait lab (UCL). The JS version is implemented by Sijia Zhao.☆17Apr 6, 2022Updated 4 years ago
- Bayesian Assessment of Hypotheses☆26Jul 6, 2023Updated 2 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆25Aug 23, 2019Updated 6 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18May 2, 2025Updated last year
- This is the Placeholder for Llama. Starting with Llama 3☆11May 20, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Sep 25, 2024Updated last year
- Make N-Gram for Uyghur language☆15Dec 24, 2020Updated 5 years ago
- ☆13Dec 22, 2023Updated 2 years ago
- Rezonator: Dynamics of human engagement☆34Jun 6, 2026Updated last week
- Run NASA's General Mission Analysis Tool (GMAT) from Julia☆10Sep 3, 2020Updated 5 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆36Aug 1, 2025Updated 10 months ago
- i tried to solve as many tasks as possible to make my SQL skills better☆15Apr 26, 2024Updated 2 years ago
- This repo contains "Azure Data Engineer Associate" Questions and related docs.☆13Jan 29, 2024Updated 2 years ago
- Label dialogue with Dialogue Acts and Adjacency Pairs☆12Jun 20, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Jupyter notebooks for course "Computational Morphology with HFST".☆21Oct 5, 2022Updated 3 years ago
- Code and experiments for the COLING2020 paper "Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations".☆11Dec 9, 2020Updated 5 years ago
- ☆15Jul 2, 2020Updated 5 years ago
- Realtime Face detection demo using YOLO v2 and OpenCV DNN module☆17Mar 10, 2018Updated 8 years ago
- simple crawler for some uyghur website such as uy.ts.cn,bbs.bagdax.cn,www.bagdax.cn(using python and scrapy)☆11Oct 4, 2020Updated 5 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- List of useful telegram groups about IT in Kazakhstan // Полезные Казахстанские IT каналы и группы в телеграм☆140Jan 29, 2026Updated 4 months ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- a corpus containing 4.5K conversations from the Conversational Question-Answering dataset CoQA, for a total of 53K follow-up question-ans…☆16Jun 12, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Sep 9, 2021Updated 4 years ago
- Course materials for 11-767☆13Nov 10, 2022Updated 3 years ago
- Svelte 5 AI context files 👽☆14Updated this week
- ☆10May 26, 2022Updated 4 years ago
- Code of telegram bot for tracking your new subscribers☆11Sep 7, 2019Updated 6 years ago
- A docker for run Wine though VNC remote manage☆15Dec 29, 2018Updated 7 years ago
- Opal is a toolkit for that enables rapid deployment of scientific applications as Web services☆12Jul 11, 2022Updated 3 years ago