Faster, modernized fork of the language identification tool langid.py
☆60Nov 22, 2024Updated last year
Alternatives and similar repositories for py3langid
Users that are interested in py3langid are comparing it to the libraries listed below
Sorting:
- A neural dependency parser that does its best☆16Dec 12, 2025Updated 2 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆159Dec 19, 2025Updated 2 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Aug 13, 2019Updated 6 years ago
- Alternative robots parser module for Python☆21Updated this week
- Python notebooks analyzing campaign finance and lobbying activity data from California Secretary of State’s CAL-ACCESS database☆22Mar 3, 2018Updated 7 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Dec 2, 2017Updated 8 years ago
- A chord generating midi controller using RP2350☆49Jan 14, 2026Updated last month
- Next-generation Punkt sentence boundary detection with zero dependencies☆29Nov 18, 2025Updated 3 months ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- AudiosPlugin is a Godot iOS Audio Plugin that resolves the audio recording issue in iOS for Godot Engine.☆10Jun 16, 2025Updated 8 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆186Jun 6, 2025Updated 8 months ago
- A python instagram scraper which uses BeautifulSoup and JSON to scrape public instagram accounts☆28Aug 4, 2017Updated 8 years ago
- Respect generative AI opt-outs in your ML training pipeline.☆39Oct 9, 2024Updated last year
- Synthetic Text Dataset Generation for LLM projects☆56Updated this week
- Combine XPath, CSS Selectors and JSONPath for Web data extracting.☆29Dec 25, 2024Updated last year
- Gibsonify — Collect nutritional data using Gibson's method!☆11Oct 28, 2023Updated 2 years ago
- This repository contains content related to 2D and 3D lane detection, as well as video lane detection. There are not only papers here, bu…☆13Sep 1, 2024Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆35Feb 5, 2026Updated 3 weeks ago
- ☆72May 22, 2023Updated 2 years ago
- Feste is a free and open-source framework allowing scalable composition of NLP tasks using a graph execution model that is optimized and …☆43Mar 19, 2023Updated 2 years ago
- fMRIPrep for Rodents☆13Feb 17, 2026Updated last week
- Camera cover / overlay / background app on Android☆32Dec 24, 2016Updated 9 years ago
- A Chrome extension for quick and compact access to your bookmarks.☆10Jun 3, 2017Updated 8 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Browser extension to auto close blacklisted tabs☆14Updated this week
- A Java Entity-Component-System game engine.☆11Dec 24, 2019Updated 6 years ago
- Minimalist library for LLM usage☆13Sep 7, 2025Updated 5 months ago
- ☆17Feb 23, 2026Updated last week
- Causality in Knowledge Graphs☆11Oct 12, 2022Updated 3 years ago
- A python implementation of the sinaplot using matplotlib and seaborn☆11Jun 5, 2018Updated 7 years ago
- Common Crawl fork of Apache Nutch☆40Updated this week
- a free and secure peer to peer meeting application☆19Sep 17, 2022Updated 3 years ago
- Peer-to-peer NATS message routing and S3 object sync solution☆18Feb 5, 2026Updated 3 weeks ago
- FiberNavigator - Maxime Chamberland -☆10Feb 9, 2022Updated 4 years ago
- COMET for African languages☆10Jan 24, 2025Updated last year
- Canadian threat feeds updated every 12 hours.☆20Updated this week
- Terminal tool that converts files encoding to UTF-8☆10Oct 5, 2019Updated 6 years ago
- A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large …☆15Sep 30, 2024Updated last year
- Vtabs provide the vertical tabs for the chrome browser.☆11Aug 12, 2024Updated last year