DuyguA / computational_linguisticsLinks
☆26Updated 3 years ago
Alternatives and similar repositories for computational_linguistics
Users that are interested in computational_linguistics are comparing it to the libraries listed below
Sorting:
- This repo contains the software that was used to conduct the experiments reported in our article titled "Improving Named Entity Recogniti…☆20Updated 3 years ago
- Turkish NER, Question-Answer and Sentence datasets☆16Updated 5 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 3 years ago
- This repository contains the Turkish word vectors and analogical reasoning task pairs produced and used during the study.☆15Updated 8 years ago
- A human-annotated morphosyntactic treebank for Turkish.☆34Updated 3 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 4 years ago
- This repository☆30Updated 3 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Updated 3 years ago
- Automatic Dialect Detection Repository☆39Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆115Updated 6 years ago
- Pronounce Arabic words☆19Updated 6 years ago
- scipts for working with open.bible data☆26Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- Word Error Rate Estimation☆15Updated 5 years ago
- Scripts to create speech corpora from open.bible☆13Updated 4 years ago
- Linguistic processing for Common Voice☆58Updated 2 years ago
- Text Classification Dataset for Turkish Language☆10Updated 4 years ago
- phone inventory library☆17Updated 2 years ago
- Calculates the Word Error Rate between two text files☆20Updated 3 years ago
- Proposed splits for the LREC Wikipron paper☆15Updated 5 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- ☆22Updated 3 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 6 years ago
- ☆20Updated 3 years ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆35Updated last year
- docker for HF wav2vec2-sprint☆13Updated 4 years ago
- IPA tokeniser☆18Updated 6 months ago
- State-of-the-art NLP tools for Turkish☆71Updated last year