neubig/kylm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neubig/kylm)

neubig / kylm

The Kyoyo Language Modeling Toolkit

☆27

Alternatives and similar repositories for kylm

Users that are interested in kylm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

steveash / jg2p
View on GitHub
Grapheme to phoneme toolkit using joint-modelling + CRFs in java
☆15Jul 14, 2018Updated 8 years ago
nowlab / DALM
View on GitHub
An Efficient Language Model Using Double-Array Structures
☆17Aug 10, 2020Updated 5 years ago
neubig / latticelm
View on GitHub
Software for unsupervised word segmentation and language model learning using lattices
☆45Aug 17, 2016Updated 9 years ago
fgnt / LatticeWordSegmentation
View on GitHub
Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model
☆17Nov 24, 2016Updated 9 years ago
tarowatanabe / cicada
View on GitHub
cicada: a hypergraph-based toolkit for statistical machine translation based on {tree, string}-to-{tree, string} models
☆42Aug 9, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
adampauls / berkeleylm
View on GitHub
Automatically exported from code.google.com/p/berkeleylm
☆101Jan 15, 2016Updated 10 years ago
takuyaa / doublearray
View on GitHub
JavaScript implementation of Double-Array trie
☆23Oct 15, 2023Updated 2 years ago
jiyfeng / dclm
View on GitHub
Document context language models
☆21Nov 13, 2015Updated 10 years ago
neubig / pialign
View on GitHub
pialign - A Phrasal ITG Aligner
☆24Apr 29, 2019Updated 7 years ago
neologd / namelti
View on GitHub
Namelti : The automatic transcription generation library for person name in Katakana
☆24Jul 10, 2023Updated 3 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
ChenChengKuan / awesome_deep_language_style_transfer
View on GitHub
collections of language style transfer papers
☆10Jan 4, 2018Updated 8 years ago
CogComp / zoe
View on GitHub
Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.
☆43Jan 16, 2020Updated 6 years ago
allenai / tableilp
View on GitHub
Question Answering via Integer Programming (TableILP)
☆28Apr 22, 2016Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
google-research-datasets / wiki-links
View on GitHub
Automatically exported from code.google.com/p/wiki-links
☆43Dec 15, 2015Updated 10 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago
dansoutner / LSTMLM
View on GitHub
Simple LSTM language modelling toolkit
☆10Oct 21, 2022Updated 3 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
maxbane / simplegoodturing
View on GitHub
Python implementation of Gale and Sampson's (1995/2001) "Simple Good Turing" algorithm.
☆36Feb 1, 2019Updated 7 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
cdcrabtree / nomine
View on GitHub
Classify names by gender, U.S. ethnicity, or leaf nationality
☆19Oct 13, 2018Updated 7 years ago
YangDK / openfst-android-ndk
View on GitHub
Build OpenFst using ndk-build
☆11Nov 22, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tkng / micter
View on GitHub
micter is a micro word segmenter which splits a sentence into words.
☆15Jun 21, 2014Updated 12 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
JuliaStrings / TinySegmenter.jl
View on GitHub
Julia version of TinySegmenter, compact Japanese tokenizer
☆21Nov 24, 2020Updated 5 years ago
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
CUNY-CL / citylex
View on GitHub
An English lexical database from the Big 🍎, let's go Mets baby love da Mets
☆18Updated this week
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
jiangnanhugo / lmkit
View on GitHub
language models toolkits with hierarchical softmax setting
☆17Mar 23, 2018Updated 8 years ago
CogComp / illinois-sl
View on GitHub
A general-purpose Java library for performing structured learning.
☆23Jul 5, 2022Updated 4 years ago
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
vsiivola / variKN
View on GitHub
A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…
☆42Sep 6, 2025Updated 10 months ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
allenai / natural-perturbations
View on GitHub
Natural Perturbation for Robust Question Answering
☆12Apr 7, 2020Updated 6 years ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆19Jun 12, 2022Updated 4 years ago
talonvoice / speech
View on GitHub
speech engine training projects
☆29Apr 19, 2021Updated 5 years ago
UKPLab / acl2016-convincing-arguments
View on GitHub
Code and data for ACL2016 article "Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidi…
☆28Aug 2, 2016Updated 9 years ago
marcusklang / wikiforia
View on GitHub
A Utility Library for Wikipedia dumps
☆33Feb 24, 2017Updated 9 years ago