LlmKira/fast-langdetect

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LlmKira/fast-langdetect)

LlmKira / fast-langdetect

⚡️ 80x faster Fasttext language detection out of the box | Split text by language

☆318

Alternatives and similar repositories for fast-langdetect

Users that are interested in fast-langdetect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zafercavdar / fasttext-langdetect
View on GitHub
80x faster and 95% accurate language identification with Fasttext
☆171May 26, 2026Updated 2 months ago
pemistahl / lingua-py
View on GitHub
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
☆1,769Jul 20, 2026Updated last week
tensorchord / qtext
View on GitHub
☆19Apr 11, 2024Updated 2 years ago
BeautyyuYanli / GPT-SoVITS-Infer
View on GitHub
The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.
☆16Sep 29, 2024Updated last year
opendatalab / UniMERNet
View on GitHub
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆494Sep 28, 2025Updated 10 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
opendatalab / magic-html
View on GitHub
☆541May 13, 2026Updated 2 months ago
pypdfium2-team / pypdfium2
View on GitHub
Python bindings to PDFium, reasonably cross-platform.
☆803Updated this week
cisnlp / GlotLID
View on GitHub
[EMNLP 2023] 💬 Language Identification with Support for More Than 2000 Labels
☆214Apr 15, 2026Updated 3 months ago
yujunhuics / LayoutReader
View on GitHub
阅读顺序、Layoutreader
☆18May 8, 2025Updated last year
IBM / fastfit
View on GitHub
FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes
☆220Sep 18, 2025Updated 10 months ago
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,239Apr 14, 2025Updated last year
jimexist / surya-rs
View on GitHub
Rust implementation of Surya
☆66Mar 1, 2025Updated last year
BeautyyuYanli / tooluser
View on GitHub
Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)
☆58May 27, 2025Updated last year
superlinear-ai / wtpsplit-lite
View on GitHub
✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models
☆39May 2, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tensorchord / vechord
View on GitHub
Turn PostgreSQL into your search engine in a Pythonic way.
☆51Aug 29, 2025Updated 11 months ago
BeautyyuYanli / Prompt-Bottle
View on GitHub
A powerful prompt template engine built upon Jinja
☆12Oct 22, 2025Updated 9 months ago
InternScience / StructEqTable-Deploy
View on GitHub
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆276Dec 6, 2025Updated 7 months ago
Mimino666 / langdetect
View on GitHub
Port of Google's language-detection library to Python.
☆1,898Mar 3, 2025Updated last year
lilingxi01 / nougat-replication
View on GitHub
A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…
☆11Dec 11, 2023Updated 2 years ago
xhluca / bm25s
View on GitHub
Fast BM25 search in Python, powered by Numpy and Numba
☆1,751Jul 22, 2026Updated last week
bjoernpl / lm-evaluation-harness-de
View on GitHub
A framework for few-shot evaluation of autoregressive language models.
☆13Feb 14, 2024Updated 2 years ago
Shekswess / synthgenai
View on GitHub
SynthGenAI - Package for Generating Synthetic Datasets using LLMs.
☆56Nov 24, 2025Updated 8 months ago
wey-gu / chinese-graph
View on GitHub
中文成语图谱，一个可以用来每天解谜汉兜 https://handle.antfu.me 的中文成语、汉字、读音图谱构建工具。
☆28Jun 2, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sieve-community / describe
View on GitHub
Incredibly descriptive audiovisual summaries for videos
☆40Aug 2, 2024Updated last year
Knowledgator / FlashDeBERTa
View on GitHub
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆90Feb 10, 2026Updated 5 months ago
abersheeran / mywxmp
View on GitHub
我的微信公众号 ”aber的个人号“
☆12May 9, 2024Updated 2 years ago
rewicks / ersatz
View on GitHub
☆51Jul 25, 2024Updated 2 years ago
datalab-to / pdftext
View on GitHub
Extract structured text from pdfs quickly
☆710Jul 8, 2026Updated 3 weeks ago
cofin / litestar-socketify
View on GitHub
Socketify plugin for Litestar
☆11Oct 8, 2023Updated 2 years ago
jackboyla / GLiREL
View on GitHub
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
☆289Mar 30, 2026Updated 3 months ago
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,811Jan 3, 2025Updated last year
steinst / SentAlign
View on GitHub
☆38Mar 16, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
KRLabsOrg / rulechef
View on GitHub
Learn rule-based models from examples using LLM-powered synthesis. Replace expensive LLM calls with fast, deterministic, inspectable rege…
☆31Jul 10, 2026Updated 2 weeks ago
Sanster / xy-cut
View on GitHub
☆154Jul 12, 2022Updated 4 years ago
bxb100 / klingai
View on GitHub
☆10Sep 5, 2024Updated last year
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,176Jul 23, 2026Updated last week
noooop / wde
View on GitHub
Workflow Defined Engine
☆25Nov 4, 2025Updated 8 months ago
fakerybakery / simpletts
View on GitHub
A lightweight Python library for running TTS models with a unified API.
☆20Feb 18, 2025Updated last year