currentslab/fastlangid

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/currentslab/fastlangid)

currentslab / fastlangid

fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)

☆43

Alternatives and similar repositories for fastlangid

Users that are interested in fastlangid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

violet-zct / fairseq-dro-mnmt
View on GitHub
☆14Sep 10, 2021Updated 4 years ago
luca-ant / WhatsSee
View on GitHub
A simple and humble image captioning application, based on a neural network built with Keras
☆10Sep 23, 2022Updated 3 years ago
giuseppeporcelli / end-to-end-ml-sm
View on GitHub
End to end Machine Learning with Amazon SageMaker
☆42Feb 16, 2024Updated 2 years ago
HLTCHKUST / elderly_ser
View on GitHub
Transferability of cross-lingual and cross-age speech emotion recognition
☆21Jun 30, 2023Updated 3 years ago
chordify / tapcorrect
View on GitHub
Supplemental material for the paper "Towards Automatically Correcting Tapped Beat Annotations for Music Recordings"
☆20May 6, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
TuringTrain / lyrics_segmentation
View on GitHub
☆13Dec 3, 2019Updated 6 years ago
alvations / SeedLing
View on GitHub
Building and Using A Seed Corpus for the Human Language Project
☆11Feb 9, 2018Updated 8 years ago
fakerybakery / simpletts
View on GitHub
A lightweight Python library for running TTS models with a unified API.
☆20Feb 18, 2025Updated last year
bicici / FDA
View on GitHub
Feature Decay Algorithms
☆11Mar 5, 2014Updated 12 years ago
HLTCHKUST / UniVaR
View on GitHub
Official reposity for paper "High-Dimension Human Value Representation in Large Language Models" (NAACL'25 Main)
☆23Jul 9, 2024Updated 2 years ago
mojesty / professor_forcing
View on GitHub
Professor forcing future code
☆10Sep 22, 2018Updated 7 years ago
DineshRaghu / multi-level-memory-network
View on GitHub
Multi-Level Memory for Task Oriented Dialogs
☆15Jul 19, 2019Updated 7 years ago
mayhewsw / pytorch-truecaser
View on GitHub
A simple neural truecaser written in pytorch and allennlp.
☆35Jun 17, 2024Updated 2 years ago
evelynkyl / yue_nmt
View on GitHub
Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project
☆16Oct 28, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
liukun95 / Noisy-NER-Confidence-Estimation
View on GitHub
The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021
☆36May 8, 2021Updated 5 years ago
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
jacopotagliabue / pixel_from_lambda
View on GitHub
Serve a 1x1 GIF pixel from an AWS lambda-powered endpoint
☆13Sep 7, 2017Updated 8 years ago
lmorgadodacosta / CantoneseWN
View on GitHub
The Cantonese Wordnet
☆15Dec 4, 2023Updated 2 years ago
musixmatchresearch / umberto
View on GitHub
UmBERTo: an Italian Language Model trained with Whole Word Masking.
☆112Feb 10, 2026Updated 5 months ago
stefan-it / gc4lm
View on GitHub
GC4LM: A Colossal (Biased) language model for German
☆13May 2, 2021Updated 5 years ago
qhungngo / EVBCorpus
View on GitHub
The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.
☆52Jul 12, 2019Updated 7 years ago
zhuzilin / vllm-group
View on GitHub
☆12Nov 5, 2024Updated last year
bochen1106 / ISMIR2019-Tutorial3-Audiovisual-Music-Processing
View on GitHub
☆33Nov 7, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rgrishman / ice
View on GitHub
Ice is a rapid information extraction customizer
☆15Apr 26, 2021Updated 5 years ago
MTG / da-tacos
View on GitHub
A Dataset for Cover Song Identification and Understanding
☆66Feb 23, 2023Updated 3 years ago
aws-samples / reinvent2019-aim362-sagemaker-debugger-model-monitor
View on GitHub
Build, train & debug, and deploy & monitor with Amazon SageMaker
☆119Aug 9, 2022Updated 3 years ago
emirdemirel / ASA_ICASSP2021
View on GitHub
A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…
☆15Oct 13, 2022Updated 3 years ago
HITsz-TMG / VisionGraph
View on GitHub
The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…
☆17May 27, 2024Updated 2 years ago
toltoxgh / CoreNLP-jMWE
View on GitHub
Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations
☆15Jan 6, 2017Updated 9 years ago
Betswish / Cross-Lingual-Consistency
View on GitHub
Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…
☆28Aug 8, 2025Updated 11 months ago
yudiandoris / csi
View on GitHub
End-to-End Chinese Speaker Identification
☆11Nov 17, 2022Updated 3 years ago
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhengzx-nlp / past-and-future-nmt
View on GitHub
Implementation of "Modeling Past and Future for Neural Machine Translation"
☆15Mar 16, 2018Updated 8 years ago
gpengzhi / CrossConST-MT
View on GitHub
Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …
☆10Jul 18, 2023Updated 3 years ago
SunbowLiu / PTvsBT
View on GitHub
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))
☆13Nov 21, 2021Updated 4 years ago
ayaka14732 / bert-tokenizer-cantonese
View on GitHub
BERT Tokenizer with vocabulary tailored for Cantonese
☆23Oct 27, 2022Updated 3 years ago
pangjh3 / AnLLM
View on GitHub
☆20Jun 17, 2024Updated 2 years ago
tenapril / kuaci
View on GitHub
Indonesian KTP Validator + Enrichment [Open Source]
☆27Mar 26, 2026Updated 3 months ago
xydaytoy / BMI-NMT
View on GitHub
☆11Jul 28, 2021Updated 4 years ago