Lightning Fast Language Prediction π
β167Aug 22, 2025Updated 6 months ago
Alternatives and similar repositories for whatthelang
Users that are interested in whatthelang are comparing it to the libraries listed below
Sorting:
- Language detection extension for spaCy 2.0+β114Feb 12, 2019Updated 7 years ago
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, β¦β23May 12, 2021Updated 4 years ago
- Pacman & Ghosts drawn in Bash!β11Dec 3, 2020Updated 5 years ago
- A Cython interface to FLANNβ24Nov 25, 2020Updated 5 years ago
- A web application tagging and retrieval of arguments in textβ30May 1, 2023Updated 2 years ago
- Hwyluso cyfieithu peirianyddol MosesSMT i'r Gymraeg // Making MosesSMT machine translation easier for Welsh (and other languages)β16Aug 25, 2021Updated 4 years ago
- β178Mar 28, 2025Updated 11 months ago
- A Python framework for exploring distributional semantic models.β85Dec 12, 2015Updated 10 years ago
- Rust python bindings for symspellβ21Dec 25, 2023Updated 2 years ago
- Simple CORPORA list crawlerβ10Dec 2, 2016Updated 9 years ago
- AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron trainingβ43Aug 28, 2013Updated 12 years ago
- Next-generation Punkt sentence boundary detection with zero dependenciesβ29Nov 18, 2025Updated 3 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β160Jun 18, 2024Updated last year
- This repo contains the code used to generate the French Wikipedia sample used in the QA annotation project PIAFβ11Jun 15, 2021Updated 4 years ago
- Chennaipy's website at chennaipy.orgβ13Feb 27, 2026Updated last week
- Lazy python recipes.β10Apr 17, 2021Updated 4 years ago
- countryinfo provides all kinds of data on countries.β12Jul 22, 2021Updated 4 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Mazeβ11Nov 16, 2021Updated 4 years ago
- a sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Modelsβ22Jan 18, 2016Updated 10 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generationβ41Apr 8, 2016Updated 9 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)β74Apr 1, 2025Updated 11 months ago
- Release history of Chatbot-Elizaβ10Dec 4, 2023Updated 2 years ago
- Text-to-image conversion (OCR) for Pashto and Chinese, with a view towards comprehensive, multi-lingual OCRβ18Jun 23, 2020Updated 5 years ago
- A style-transfer project using CycleGAN to render photos in the style of Studio Ghibli animations.β11Feb 20, 2019Updated 7 years ago
- Scala JSON-RPC server for Stanford CoreNLPβ10Nov 9, 2021Updated 4 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of Germanβ12Feb 27, 2023Updated 3 years ago
- Named Entity Recognizer for Arabicβ12Nov 22, 2017Updated 8 years ago
- πΈ GlotWeb: Web Indexing for Minority Languages (WWW 2026)β17Feb 27, 2026Updated last week
- code and data used to build a training dataset for dragnet modelsβ10Nov 29, 2020Updated 5 years ago
- πNeural Sentential Paraphrase Generation to Augment Chatbot Training Datasetβ21Dec 7, 2022Updated 3 years ago
- Morphological analyzer and lemmatizer for Latin.β29Dec 10, 2025Updated 2 months ago
- Tool to fix bitexts and tag near-duplicates for removalβ34Sep 4, 2025Updated 6 months ago
- Multi-Langauge Identificationβ28Jul 25, 2024Updated last year
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.β33Sep 4, 2018Updated 7 years ago
- Calculates Word Mover's Distance Insanely Fastβ462Aug 17, 2023Updated 2 years ago
- Textpipe: clean and extract metadata from textβ302Jun 9, 2021Updated 4 years ago
- We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese anβ¦β12Aug 3, 2021Updated 4 years ago
- Source code for the Apple reproductionβ33Apr 23, 2021Updated 4 years ago
- β14Jun 10, 2020Updated 5 years ago