Lightning Fast Language Prediction π
β167Aug 22, 2025Updated 8 months ago
Alternatives and similar repositories for whatthelang
Users that are interested in whatthelang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Language detection extension for spaCy 2.0+β114Feb 12, 2019Updated 7 years ago
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, β¦β23May 12, 2021Updated 5 years ago
- Source code for the Apple reproductionβ33Apr 23, 2021Updated 5 years ago
- Spark package to "plug" holes in data using SQL based rules β‘οΈ πβ29May 15, 2020Updated 5 years ago
- Simple CORPORA list crawlerβ10Dec 2, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β179Mar 28, 2025Updated last year
- Lazy python recipes.β10Apr 17, 2026Updated 3 weeks ago
- [WWW 2026] πΈ GlotWeb: Web Indexing for Minority Languagesβ17Apr 14, 2026Updated 3 weeks ago
- Unsupervised concept extraction from clinical textβ14Jun 17, 2024Updated last year
- RocksDB Ops CLIβ11Dec 17, 2016Updated 9 years ago
- A toolkit to create, launch and monitor SLURM jobs over existing python scripts.β12May 13, 2024Updated last year
- Tool to fix bitexts and tag near-duplicates for removalβ35Sep 4, 2025Updated 8 months ago
- Rust python bindings for symspellβ21Dec 25, 2023Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependenciesβ30Nov 18, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- C++ implementation of Generalised Brown clustering and python scripts for feature generationβ41Apr 8, 2016Updated 10 years ago
- Port of Google's language-detection library to Python.β1,884Mar 3, 2025Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β160Jun 18, 2024Updated last year
- Calculates Word Mover's Distance Insanely Fastβ458Aug 17, 2023Updated 2 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)β76Apr 1, 2025Updated last year
- A set of procedures to estimate the readability of a textβ15Apr 30, 2018Updated 8 years ago
- Textpipe: clean and extract metadata from textβ302Jun 9, 2021Updated 4 years ago
- Targetted language identifier, based on FastText and Hunspell.β38Sep 4, 2025Updated 8 months ago
- GoCD plugins to work with MLFlow as model repository in a CD flowβ32Nov 1, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The Arabic NLP Python Library (Archived in favor of Matn library)β11Apr 28, 2017Updated 9 years ago
- We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese anβ¦β12Aug 3, 2021Updated 4 years ago
- A Python framework for exploring distributional semantic models.β85Dec 12, 2015Updated 10 years ago
- Keyphrase Extraction Reviewβ14Dec 17, 2025Updated 4 months ago
- Natural language detection, Java bindings for CLD2β17Feb 26, 2026Updated 2 months ago
- β32Jun 16, 2021Updated 4 years ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language textβ1,717Apr 23, 2026Updated 2 weeks ago
- Chennaipy's website at chennaipy.orgβ13May 2, 2026Updated last week
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.β116Mar 5, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://β¦β16Feb 17, 2022Updated 4 years ago
- Randomly sample lines from massive text files efficientlyβ17Apr 1, 2015Updated 11 years ago
- Information and resources related to the talks done at Chennaipy meetups.β11Jun 3, 2018Updated 7 years ago
- Language detection using Spacy and Fasttextβ55Dec 17, 2023Updated 2 years ago
- Python bindings to the Compact Language Detectorβ33Apr 30, 2020Updated 6 years ago
- Fast Word Clustering Softwareβ79Feb 8, 2025Updated last year
- πNeural Sentential Paraphrase Generation to Augment Chatbot Training Datasetβ21Dec 7, 2022Updated 3 years ago