fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)
β43Dec 6, 2022Updated 3 years ago
Alternatives and similar repositories for fastlangid
Users that are interested in fastlangid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β14Sep 10, 2021Updated 4 years ago
- Experiments with Hugging Face π¬ π€β46Apr 18, 2026Updated 2 weeks ago
- A simple and humble image captioning application, based on a neural network built with Kerasβ10Sep 23, 2022Updated 3 years ago
- End to end Machine Learning with Amazon SageMakerβ43Feb 16, 2024Updated 2 years ago
- Transferability of cross-lingual and cross-age speech emotion recognitionβ21Jun 30, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β14Dec 3, 2019Updated 6 years ago
- Supplemental material for the paper "Towards Automatically Correcting Tapped Beat Annotations for Music Recordings"β20May 6, 2021Updated 4 years ago
- Targetted language identifier, based on FastText and Hunspell.β38Sep 4, 2025Updated 7 months ago
- An audio and transcribed corpus of contemporary Hong Kong Cantoneseβ40Dec 30, 2020Updated 5 years ago
- Building and Using A Seed Corpus for the Human Language Projectβ11Feb 9, 2018Updated 8 years ago
- β22Sep 26, 2022Updated 3 years ago
- Memcached module for Nest framework (node.js) πβ19Apr 23, 2026Updated last week
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMsβ36Mar 23, 2026Updated last month
- Repo for the Wasabi datasetsβ117Apr 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"β14Jul 8, 2025Updated 9 months ago
- β10Feb 2, 2021Updated 5 years ago
- Professor forcing future codeβ10Sep 22, 2018Updated 7 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" projectβ16Oct 28, 2022Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.β33Jun 17, 2024Updated last year
- An ASGI function for proxying to a backend over HTTPβ23Jul 14, 2025Updated 9 months ago
- The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021β36May 8, 2021Updated 4 years ago
- The accompanying code and data for the Springer 2017 publication "What's missing in geographical parsing?" in Language Resources and Evalβ¦β18Oct 17, 2019Updated 6 years ago
- β27May 15, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)β16May 8, 2022Updated 3 years ago
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented anβ¦β28Sep 27, 2024Updated last year
- The Cantonese Wordnetβ14Dec 4, 2023Updated 2 years ago
- Build, train & debug, and deploy & monitor with Amazon SageMakerβ119Aug 9, 2022Updated 3 years ago
- A Dataset for Cover Song Identification and Understandingβ65Feb 23, 2023Updated 3 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive biβ¦β15Oct 13, 2022Updated 3 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.β51Jul 12, 2019Updated 6 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformerβ40Jul 14, 2020Updated 5 years ago
- Zero-shot Cross-lingual Task-Oriented Dialogue Systems (EMNLP 2019)β24Nov 9, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β33Nov 7, 2019Updated 6 years ago
- Search Engine Guided Non-Parametric Neural Machine Translationβ14Oct 23, 2017Updated 8 years ago
- Model training tutorials for the Stanza Python NLP Libraryβ41Jul 12, 2022Updated 3 years ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Cβ¦β17May 27, 2024Updated last year
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency β¦β10Jul 18, 2023Updated 2 years ago
- Code repository for the project "Automated Detection and Removal of Advertisements in Audio Clips"β13May 30, 2016Updated 9 years ago
- β13Apr 22, 2024Updated 2 years ago