Lightning Fast Language Prediction π
β168Aug 22, 2025Updated 10 months ago
Alternatives and similar repositories for whatthelang
Users that are interested in whatthelang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language detection extension for spaCy 2.0+β114Feb 12, 2019Updated 7 years ago
- β15Apr 28, 2020Updated 6 years ago
- Simple CORPORA list crawlerβ11Dec 2, 2016Updated 9 years ago
- A web application tagging and retrieval of arguments in textβ30May 1, 2023Updated 3 years ago
- Lazy python recipes.β10Apr 17, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [WWW 2026] πΈ GlotWeb: Web Indexing for Minority Languagesβ17Apr 14, 2026Updated 2 months ago
- Unsupervised concept extraction from clinical textβ14Jun 17, 2024Updated 2 years ago
- Tool to fix bitexts and tag near-duplicates for removalβ35Sep 4, 2025Updated 9 months ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.β39Feb 5, 2026Updated 4 months ago
- Multi-Langauge Identificationβ28Jul 25, 2024Updated last year
- Rust python bindings for symspellβ21Dec 25, 2023Updated 2 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of Germanβ12Feb 27, 2023Updated 3 years ago
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stβ¦β25Jan 3, 2025Updated last year
- C++ implementation of Generalised Brown clustering and python scripts for feature generationβ41Apr 8, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Port of Google's language-detection library to Python.β1,893Mar 3, 2025Updated last year
- Hwyluso cyfieithu peirianyddol MosesSMT i'r Gymraeg // Making MosesSMT machine translation easier for Welsh (and other languages)β16Aug 25, 2021Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β160Jun 18, 2024Updated 2 years ago
- Calculates Word Mover's Distance Insanely Fastβ458Aug 17, 2023Updated 2 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extractionβ11Oct 25, 2017Updated 8 years ago
- Refactor your software using programming language independent, case-preserving string replacementβ17Jul 9, 2019Updated 6 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)β77Apr 1, 2025Updated last year
- A set of procedures to estimate the readability of a textβ15Apr 30, 2018Updated 8 years ago
- Textpipe: clean and extract metadata from textβ302Jun 9, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The Arabic NLP Python Library (Archived in favor of Matn library)β10Apr 28, 2017Updated 9 years ago
- A Python framework for exploring distributional semantic models.β85Dec 12, 2015Updated 10 years ago
- β28Jul 29, 2023Updated 2 years ago
- Detect and classify pagination linksβ15Sep 9, 2020Updated 5 years ago
- Natural language detection, Java bindings for CLD2β17Feb 26, 2026Updated 4 months ago
- A collection of Python scripts to download and extract rating datasets from Twitter for multiple websitesβ28Oct 17, 2020Updated 5 years ago
- a sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Modelsβ21Jan 18, 2016Updated 10 years ago
- Language detection using Spacy and Fasttextβ54Dec 17, 2023Updated 2 years ago
- Blazing fast language detection using fastText modelβ24Dec 18, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python bindings to the Compact Language Detectorβ33Apr 30, 2020Updated 6 years ago
- FastFormers - highly efficient transformer models for NLUβ706Mar 21, 2025Updated last year
- This repo contains the code used to generate the French Wikipedia sample used in the QA annotation project PIAFβ11Jun 15, 2021Updated 5 years ago
- Stand-alone language identification systemβ2,462Jan 1, 2020Updated 6 years ago
- Fast Word Clustering Softwareβ79Feb 8, 2025Updated last year
- πNeural Sentential Paraphrase Generation to Augment Chatbot Training Datasetβ21Dec 7, 2022Updated 3 years ago
- Faiss bindings for Javaβ24Oct 9, 2020Updated 5 years ago