An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
☆33Jan 15, 2025Updated last year
Alternatives and similar repositories for KazNERD
Users that are interested in KazNERD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NLP tools for Kazakh language☆36Apr 5, 2022Updated 4 years ago
- Open Source Kazakh Corpus☆21Apr 25, 2023Updated 3 years ago
- An open-source parallel corpus for machine translation across Kazakh, English, Russian, and Turkish☆17Mar 29, 2024Updated 2 years ago
- the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTT…☆56Jul 30, 2021Updated 4 years ago
- ☆16Aug 1, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Handwritten Kazakh and Russian (HKR) database for text recognition☆79Aug 17, 2021Updated 4 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆25Aug 23, 2019Updated 6 years ago
- Docker container for UDPipe (https://github.com/ufal/udpipe) REST server.☆12Jun 23, 2020Updated 6 years ago
- Sentiment analysis in Uzbek language and new Datasets of Uzbek App reviews for Sentiment Classification☆19Dec 26, 2022Updated 3 years ago
- This is the Placeholder for Llama. Starting with Llama 3☆11May 20, 2024Updated 2 years ago
- LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)☆10Oct 18, 2021Updated 4 years ago
- ☆31Jul 23, 2022Updated 3 years ago
- GPT Table Semantic Parsing with complex & non-intuitive structure.☆17Jul 16, 2025Updated 11 months ago
- This repo contains "Azure Data Engineer Associate" Questions and related docs.☆13Jan 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Label dialogue with Dialogue Acts and Adjacency Pairs☆12Jun 20, 2023Updated 3 years ago
- ☆12Jun 9, 2025Updated last year
- Jupyter notebooks for course "Computational Morphology with HFST".☆21Oct 5, 2022Updated 3 years ago
- ☆15Jul 2, 2020Updated 6 years ago
- A mock social networking platform made using Node.js and MongoDB.☆17Sep 19, 2021Updated 4 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Language Models for Code Completion: a Practical Evaluation☆13Jan 19, 2024Updated 2 years ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- ☆11Sep 9, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- a corpus containing 4.5K conversations from the Conversational Question-Answering dataset CoQA, for a total of 53K follow-up question-ans…☆16Jun 12, 2023Updated 3 years ago
- Course materials for 11-767☆14Nov 10, 2022Updated 3 years ago
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated last year
- ☆11Jun 23, 2022Updated 4 years ago
- ☆14Aug 9, 2021Updated 4 years ago
- ☆17Sep 30, 2019Updated 6 years ago
- A scraper that i made to get predictions from multiple websites and then convert them all into a xlsx file.☆11Jul 19, 2022Updated 3 years ago
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated last year
- Online book for sits☆32May 16, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 5 years ago
- ☆13Aug 3, 2024Updated last year
- a function of experience divided by time☆15Jul 9, 2010Updated 15 years ago
- Repository for DISRPT2021 shared task☆16Sep 5, 2022Updated 3 years ago
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago