raphaelsty / chercheLinks
Neural Search
☆334Updated last year
Alternatives and similar repositories for cherche
Users that are interested in cherche are comparing it to the libraries listed below
Sorting:
- Neural Search☆365Updated 8 months ago
- Labelling platform for text using weak supervision.☆262Updated 3 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆326Updated 7 months ago
- Blazing fast framework for fine-tuning similarity learning models☆661Updated last month
- Full text search that feels like a numpy array☆265Updated last month
- More interactive weak supervision with FlyingSquid☆316Updated 5 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆244Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆520Updated 2 months ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆401Updated 4 months ago
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- Gain clues from clustering!☆318Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- OCR, Archive, Index and Search: Implementation agnostic OCR framework.☆223Updated 2 years ago
- SpanMarker for Named Entity Recognition☆463Updated 10 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆139Updated 2 years ago
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- A word2vec negative sampling implementation with correct CBOW update.☆261Updated 4 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆129Updated last year
- A Simple Bulk Labelling Tool☆599Updated 4 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆624Updated 3 months ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆893Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Prompt programming with FMs.☆444Updated last year
- The world's largest social media toxicity dataset.☆187Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year