scarletcho/KoLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scarletcho/KoLM)

scarletcho / KoLM

Korean text normalization and language preparation package for LM in Kaldi-based ASR system

☆64

Alternatives and similar repositories for KoLM

Users that are interested in KoLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scarletcho / KoG2P
View on GitHub
Korean grapheme-to-phone conversion in Python
☆133Jan 27, 2020Updated 6 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
homink / speech.ko
View on GitHub
Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language
☆43Feb 28, 2018Updated 8 years ago
warnikchow / kosp2e
View on GitHub
Korean Speech to English Translation Corpus
☆45Sep 3, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lovit / flask_api_tutorial
View on GitHub
Flask 로 API 를 만들기 위한 튜토리얼
☆10Jun 22, 2020Updated 6 years ago
openkorpos / model-mecab
View on GitHub
MeCab model trained with OpenKorPos.
☆23Jun 19, 2022Updated 4 years ago
dobby-seo / kosr
View on GitHub
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
☆31Feb 19, 2021Updated 5 years ago
qqueing / pytorch-G2P
View on GitHub
(semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean
☆23Dec 17, 2017Updated 8 years ago
jeongukjae / korean-wikipedia-corpus
View on GitHub
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
☆24Sep 6, 2023Updated 2 years ago
cedar101 / twitter-korean-py
View on GitHub
Python port to the normalizer in https://github.com/twitter/twitter-korean-text
☆25Apr 26, 2016Updated 10 years ago
emotiontts / emotiontts_open_db
View on GitHub
로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼
☆108Feb 5, 2025Updated last year
insikk / namu_wiki_db_preprocess
View on GitHub
A python script to convert namu wiki database to huge Korean language corpus
☆29Apr 21, 2017Updated 9 years ago
Kyubyong / g2pK
View on GitHub
g2pK: g2p module for Korean
☆271Mar 1, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HGU-DLLAB / Korean-FastSpeech2-Pytorch
View on GitHub
Implementation of Korean FastSpeech2
☆215Jan 29, 2023Updated 3 years ago
lovit / namuwikitext
View on GitHub
Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)
☆53Oct 25, 2020Updated 5 years ago
amazon-science / proteno
View on GitHub
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45May 25, 2021Updated 5 years ago
lovit / huggingface_konlpy
View on GitHub
Training Transformers of Huggingface with KoNLPy
☆68Aug 28, 2020Updated 5 years ago
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
songys / entity
View on GitHub
날짜, 장소, 사람, 기관, 시간
☆23Jan 10, 2023Updated 3 years ago
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
songys / AwesomeKorean_Speech
View on GitHub
음성인식과 신호처리
☆14Sep 12, 2021Updated 4 years ago
upskyy / Paper-Review
View on GitHub
Paper Review about Speech Recognition · NLP
☆10Mar 25, 2021Updated 5 years ago
hyunwoongko / megatron-11b
View on GitHub
Megatron LM 11B on Huggingface Transformers
☆28Jul 11, 2021Updated 5 years ago
dobby-seo / korean-speech-recognition-quartznet
View on GitHub
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식
☆22Jul 21, 2021Updated 5 years ago
forkonlp / newspaper
View on GitHub
대부분의 신문사 뉴스를 수집하는 것을 목적으로 하는 크롤러 제작 프로젝트
☆11Jul 29, 2019Updated 6 years ago
passing2961 / KMRE
View on GitHub
Korean Moview Review Emotion (KMRE) Dataset
☆21Sep 7, 2020Updated 5 years ago
zhaohb / MeloTTS-OV
View on GitHub
Using OpenVINO to speed up MeloTTS inference
☆15Nov 1, 2024Updated last year
JoungheeKim / kor-spacing
View on GitHub
This is project for korean auto spacing
☆12Aug 3, 2020Updated 5 years ago
noowad93 / chosung-translator
View on GitHub
초성 해석기 based on ko-BART
☆29Mar 31, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sooftware / kospeech
View on GitHub
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆637May 27, 2023Updated 3 years ago
SMART-TTS / SMART-G2P
View on GitHub
☆103Mar 24, 2023Updated 3 years ago
YongWookHa / kor-text-preprocess
View on GitHub
Korean text data preprocess toolkit for NLP
☆18Jun 11, 2019Updated 7 years ago
eagle705 / korean-ner-cnn-bilstm
View on GitHub
CNN+BiLSTM 기반 한국어 개체명 인식기입니다
☆57Nov 26, 2019Updated 6 years ago
knlee-voice / AI.Tech
View on GitHub
Trends, Tools, News timeline ...
☆20Oct 13, 2025Updated 9 months ago
lovit / soyspacing
View on GitHub
띄어쓰기 오류 교정 라이브러리입니다. CRF 와 같은 머신러닝 알고리즘이 아닌, 직관적인 접근법으로 띄어쓰기를 교정합니다.
☆149Sep 26, 2019Updated 6 years ago
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago