zafercavdar / fasttext-langdetect
80x faster and 95% accurate language identification with Fasttext
☆152Updated last year
Alternatives and similar repositories for fasttext-langdetect:
Users that are interested in fasttext-langdetect are comparing it to the libraries listed below
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆189Updated 3 weeks ago
- Simply, faster, sentence-transformers☆141Updated 7 months ago
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆197Updated 6 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆127Updated 4 months ago
- Python API for https://vespa.ai, the open big data serving engine☆121Updated this week
- Generalist and Lightweight Model for Text Classification☆121Updated 2 weeks ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 11 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆51Updated 3 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 5 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- A Python Search Engine for Humans 🥸☆216Updated last year
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆122Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆151Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆144Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 11 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆174Updated 7 months ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Updated this week
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 11 months ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆106Updated last year
- RaKUn 2.0 - A fast keyword detection algorithm☆67Updated this week
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extraction…☆104Updated 10 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆130Updated 4 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆244Updated 2 years ago
- Targetted language identifier, based on FastText and Hunspell.☆34Updated 2 months ago
- Streamlit Named Entity Recognition (NER) annotation custom component☆38Updated 2 years ago
- ☆67Updated 4 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆124Updated 3 months ago