explosion / spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
☆1,213Updated 2 months ago
Alternatives and similar repositories for spacy-llm:
Users that are interested in spacy-llm are comparing it to the libraries listed below
- SpanMarker for Named Entity Recognition☆422Updated 2 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,063Updated last week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆1,861Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,321Updated last month
- ☆357Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆884Updated 11 months ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆322Updated last year
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,381Updated this week
- A tiny library for coding with large language models.☆1,225Updated 8 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,338Updated last month
- Efficient few-shot learning with Sentence Transformers☆2,415Updated 2 months ago
- ☆497Updated 7 months ago
- Efficient Retrieval Augmentation and Generation Framework☆1,489Updated 2 months ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆864Updated last year
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,921Updated 2 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆414Updated last year
- Easily embed, cluster and semantically label text datasets☆516Updated 11 months ago
- Open-source tool to visualise your RAG 🔮☆1,115Updated 2 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,740Updated 3 weeks ago
- An LLM-powered advanced RAG pipeline built from scratch☆830Updated last year
- ☆449Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Data☆487Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,736Updated 3 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,299Updated 2 weeks ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,271Updated 4 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆990Updated last month
- Guideline following Large Language Model for Information Extraction☆354Updated 4 months ago
- ☆761Updated last year
- LLM(😽)☆1,661Updated last month
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,882Updated this week