adbar/py3langid

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/adbar/py3langid)

adbar / py3langid

Faster, modernized fork of the language identification tool langid.py

☆63

Alternatives and similar repositories for py3langid

Users that are interested in py3langid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

accelerated-text / reaction-acc-text-demo
View on GitHub
Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.
☆13Feb 22, 2021Updated 5 years ago
adbar / courlan
View on GitHub
Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters
☆177Updated this week
Health-RI / health-ri-metadata
View on GitHub
health ri metadata schemas
☆16Jul 13, 2026Updated last week
adbar / htmldate
View on GitHub
Fast and robust date extraction from web pages, with Python or on the command-line
☆154Updated this week
darvid / biome
View on GitHub
Provides painless access to namespaced environment variables.
☆13Apr 20, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
maehr / github-template
View on GitHub
GitHub template for small projects.
☆22Mar 6, 2026Updated 4 months ago
sparna-git / skos-testing-tool
View on GitHub
A web UI to asses the quality of SKOS and SKOS-XL files. Frontend for qSKOS.
☆15Apr 30, 2026Updated 2 months ago
kpu / fasterText
View on GitHub
Library for fast text representation and classification.
☆31Jan 9, 2024Updated 2 years ago
mbanon / fastspell
View on GitHub
Targetted language identifier, based on FastText and Hunspell.
☆38Sep 4, 2025Updated 10 months ago
kailas-v / human-ai-interactions
View on GitHub
☆11Oct 28, 2022Updated 3 years ago
UniversalDependencies / UD_German-HDT
View on GitHub
☆14May 29, 2026Updated last month
andreburgaud / robotspy
View on GitHub
Alternative robots parser module for Python
☆22Jun 19, 2026Updated last month
rasbt / try-lion-optimizer
View on GitHub
☆14Mar 9, 2023Updated 3 years ago
BPI-SINOVOIP / BPI-M2-bsp
View on GitHub
Supports BananaPi BPI -M2 (Kernel3.3)
☆11Nov 3, 2016Updated 9 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
palewire / python-calaccess-notebooks
View on GitHub
Python notebooks analyzing campaign finance and lobbying activity data from California Secretary of State’s CAL-ACCESS database
☆21Mar 3, 2018Updated 8 years ago
tokenmill / timewords
View on GitHub
Multilingual library to easily parse date strings to java.util.Date objects.
☆32Sep 4, 2019Updated 6 years ago
kennethleungty / Simulated-Annealing-Feature-Selection
View on GitHub
Feature Selection using Simulated Annealing
☆11Aug 10, 2022Updated 3 years ago
Pringled / agentcheck
View on GitHub
Check what an AI agent can access before you run it
☆27Mar 8, 2026Updated 4 months ago
rosette-api / python
View on GitHub
Babel Street Analytics Client Library for Python
☆38May 7, 2026Updated 2 months ago
tokenmill / crawling-framework
View on GitHub
Easily crawl news portals or blog sites using Storm Crawler.
☆22Nov 15, 2022Updated 3 years ago
SAP / software-documentation-data-set-for-machine-translation
View on GitHub
A parallel evaluation data set of SAP software documentation with document structure annotation
☆15Jun 12, 2026Updated last month
pjox / gutf
View on GitHub
Terminal tool that converts files encoding to UTF-8
☆10Oct 5, 2019Updated 6 years ago
masakhane-io / africomet
View on GitHub
COMET for African languages
☆11Jan 24, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ARBML / dar
View on GitHub
A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
☆11Jun 23, 2024Updated 2 years ago
AxelSorensenDev / Eevee
View on GitHub
An Easy Annotation Tool for Natural Language Processing
☆12May 17, 2024Updated 2 years ago
UBC-NLP / afrolid
View on GitHub
AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
☆39Feb 5, 2026Updated 5 months ago
shyyhs / CourseraParallelCorpusMining
View on GitHub
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
☆15Aug 27, 2024Updated last year
WatheqAlshowaiter / developer-portfolios
View on GitHub
A list of developer portfolios for your inspiration
☆10Aug 3, 2020Updated 5 years ago
pemistahl / lingua-py
View on GitHub
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
☆1,759Updated this week
shayne-longpre / a-pretrainers-guide
View on GitHub
☆71May 22, 2023Updated 3 years ago
qcri / ArabicSpellChecker
View on GitHub
☆12May 21, 2020Updated 6 years ago
caarlos0-graveyard / github-vacations
View on GitHub
Automagically ignore all notifications related to work when you are on vacations
☆21Aug 21, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jungwhank / transformer-pl
View on GitHub
Transformer Implementation for NMT using PyTorch Lightning (Korean to English)
☆10Oct 19, 2020Updated 5 years ago
fsvbach / WassersteinTSNE
View on GitHub
☆17May 19, 2025Updated last year
salgado / music-search
View on GitHub
Code from blog 'Searching by Music: Leveraging Vector Search for Music Information Retrieval'
☆16Nov 16, 2023Updated 2 years ago
Aazhar / keras2tensorflow
View on GitHub
Tutorial on running keras model in C++ and python tensorflow
☆11Oct 30, 2018Updated 7 years ago
kermitt2 / arxiv_harvester
View on GitHub
Poor man's simple harvester for arXiv resources
☆14Jul 14, 2023Updated 3 years ago
SathvikEadla / W-SVM
View on GitHub
Implementation of an Openset Recognition algorithm.
☆12Sep 13, 2020Updated 5 years ago
slub / docsa
View on GitHub
SLUB Document Classification and Similarity Analysis
☆10Aug 31, 2023Updated 2 years ago