raphaelsty / cherche
Neural Search
☆329Updated 11 months ago
Alternatives and similar repositories for cherche:
Users that are interested in cherche are comparing it to the libraries listed below
- Neural Search☆355Updated last month
- Blazing fast framework for fine-tuning similarity learning models☆657Updated 3 weeks ago
- Full text search that feels like a numpy array☆236Updated 2 weeks ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆311Updated last week
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆333Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆497Updated last month
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆383Updated 2 weeks ago
- Labelling platform for text using weak supervision.☆262Updated 2 years ago
- SpanMarker for Named Entity Recognition☆428Updated 3 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated last year
- Gain clues from clustering!☆313Updated 9 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆220Updated 2 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆885Updated last year
- Late Interaction Models Training & Retrieval☆306Updated this week
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated 7 months ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆322Updated last year
- More interactive weak supervision with FlyingSquid☆315Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 11 months ago
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆122Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆199Updated last week
- skweak: A software toolkit for weak supervision applied to NLP tasks☆922Updated 8 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 11 months ago
- Doubt your data, find bad labels.☆511Updated 9 months ago
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆546Updated 10 months ago
- A Simple Bulk Labelling Tool☆576Updated 4 months ago
- A Python library for calculating a large variety of metrics from text☆337Updated 4 months ago