Faster, modernized fork of the language identification tool langid.py
☆62Nov 22, 2024Updated last year
Alternatives and similar repositories for py3langid
Users that are interested in py3langid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆13Feb 22, 2021Updated 5 years ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆175Updated this week
- Fast and robust date extraction from web pages, with Python or on the command-line☆153Jun 15, 2026Updated 2 weeks ago
- Pydantic data models for DCAT-AP v3 and the Health-RI metadata model.☆19Jun 22, 2026Updated last week
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Aug 13, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A web UI to asses the quality of SKOS and SKOS-XL files. Frontend for qSKOS.☆15Apr 30, 2026Updated 2 months ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 9 months ago
- ☆13May 29, 2026Updated last month
- Alternative robots parser module for Python☆22Jun 19, 2026Updated last week
- ☆14Mar 9, 2023Updated 3 years ago
- Supports BananaPi BPI -M2 (Kernel3.3)☆11Nov 3, 2016Updated 9 years ago
- Python notebooks analyzing campaign finance and lobbying activity data from California Secretary of State’s CAL-ACCESS database☆21Mar 3, 2018Updated 8 years ago
- ☆11Oct 28, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 8 months ago
- Babel Street Analytics Client Library for Python☆38May 7, 2026Updated last month
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆15Jun 12, 2026Updated 2 weeks ago
- ☆13Sep 1, 2023Updated 2 years ago
- Terminal tool that converts files encoding to UTF-8☆10Oct 5, 2019Updated 6 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- All code and content for my blog.☆15Sep 23, 2018Updated 7 years ago
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆39Feb 5, 2026Updated 4 months ago
- Synthetic Text Dataset Generation for LLM projects☆58Jun 16, 2026Updated 2 weeks ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,746Jun 18, 2026Updated last week
- Automagically ignore all notifications related to work when you are on vacations☆21Aug 21, 2020Updated 5 years ago
- Tools for speech recognition☆11Jun 24, 2017Updated 9 years ago
- Code from blog 'Searching by Music: Leveraging Vector Search for Music Information Retrieval'☆16Nov 16, 2023Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆31Nov 18, 2025Updated 7 months ago
- Tutorial on running keras model in C++ and python tensorflow☆11Oct 30, 2018Updated 7 years ago
- Tonto is a DSL created to make it easier to work with Ontologies based on OntoUML☆34Jun 23, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of an Openset Recognition algorithm.☆12Sep 13, 2020Updated 5 years ago
- Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport (Findings-ACL 2023)☆13May 4, 2023Updated 3 years ago
- ACL style for Typst☆23Jan 27, 2026Updated 5 months ago
- Poetry Corpora Annotated on Aesthetic Emotions☆13Aug 2, 2022Updated 3 years ago
- Code for Detecting language from text in python using fasttext☆13May 25, 2020Updated 6 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Dec 2, 2017Updated 8 years ago
- SFST/SMOR/DWDS-based German Morphology☆21Updated this week