zafercavdar / fasttext-langdetect
80x faster and 95% accurate language identification with Fasttext
☆153Updated last year
Alternatives and similar repositories for fasttext-langdetect
Users that are interested in fasttext-langdetect are comparing it to the libraries listed below
Sorting:
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆204Updated last week
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆196Updated last month
- Simply, faster, sentence-transformers☆142Updated 8 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆131Updated 5 months ago
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- Python API for https://vespa.ai, the open big data serving engine☆123Updated last week
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆154Updated 11 months ago
- Generalist and Lightweight Model for Text Classification☆128Updated 2 weeks ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- SpanMarker for Named Entity Recognition☆429Updated 4 months ago
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆84Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆152Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- ☆360Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆93Updated last year
- multimodal document analysis☆164Updated 11 months ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Updated last week
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆246Updated 2 years ago
- Streamlit Named Entity Recognition (NER) annotation custom component☆38Updated 2 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆132Updated 4 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- Logical structure analysis for visually structured documents☆89Updated 2 years ago
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆177Updated 8 months ago
- 📝An easy-to-use package to restore punctuation of the text.☆115Updated 2 years ago
- A small seq2seq punctuator tool based on DistilBERT☆51Updated 4 months ago
- ☆170Updated last month
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆337Updated 5 months ago