zafercavdar / fasttext-langdetectLinks
80x faster and 95% accurate language identification with Fasttext
☆162Updated last year
Alternatives and similar repositories for fasttext-langdetect
Users that are interested in fasttext-langdetect are comparing it to the libraries listed below
Sorting:
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆212Updated 4 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆160Updated 3 months ago
- Simply, faster, sentence-transformers☆143Updated last year
- ⚡️ 80x faster Fasttext language detection out of the box | Split text by language☆233Updated 5 months ago
- A Python Search Engine for Humans 🥸☆232Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆253Updated 2 years ago
- Efficient few-shot learning with cross-encoders.☆58Updated last year
- Python API for https://vespa.ai, the open big data serving engine☆141Updated this week
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- ☆173Updated 5 months ago
- The pipeline for the OSCAR corpus☆171Updated last year
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆78Updated last month
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆80Updated 10 months ago
- ☆367Updated last year
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆228Updated last year
- multimodal document analysis☆166Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆172Updated 3 months ago
- ☆110Updated 9 months ago
- 🔢 Work with static vector models☆29Updated 4 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆126Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Guideline following Large Language Model for Information Extraction☆395Updated 10 months ago
- Generalist and Lightweight Model for Text Classification☆157Updated 3 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆189Updated last year
- Few-shot Named Entity Recognition☆123Updated 3 years ago