raphaelsty / cherche
Neural Search
β327Updated 9 months ago
Alternatives and similar repositories for cherche:
Users that are interested in cherche are comparing it to the libraries listed below
- Neural Searchβ351Updated this week
- Labelling platform for text using weak supervision.β260Updated 2 years ago
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ309Updated last year
- π βοΈ ETL processes for medical and scientific papersβ376Updated 2 months ago
- just a bunch of useful embeddings for scikit-learn pipelinesβ483Updated last month
- Full text search in your Pandas dataframeβ220Updated 3 months ago
- β¨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3β322Updated last year
- Gain clues from clustering!β313Updated 7 months ago
- More interactive weak supervision with FlyingSquidβ315Updated 4 years ago
- Blazing fast framework for fine-tuning similarity learning modelsβ656Updated 2 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engineβ242Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β244Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasksβ923Updated 6 months ago
- Active Learning for Text Classification in Pythonβ606Updated last week
- π Datasets and models for instruction-tuningβ234Updated last year
- Late Interaction Models Training & Retrievalβ254Updated this week
- SpikeX - SpaCy Pipes for Knowledge Extractionβ397Updated 3 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)β156Updated last year
- π€ A PyTorch library of curated Transformer models and their composable componentsβ882Updated 10 months ago
- SpanMarker for Named Entity Recognitionβ421Updated 2 months ago
- Doubt your data, find bad labels.β509Updated 7 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β330Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)β151Updated 2 years ago
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ189Updated 5 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.β219Updated 2 years ago
- A Python framework for performing information retrieval experiments, building on http://terrier.org/β438Updated last week
- Few-shot Named Entity Recognitionβ123Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β121Updated 10 months ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 9 months ago