xlang-ai / instructor-embeddingLinks

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

☆1,990

Alternatives and similar repositories for instructor-embedding

Users that are interested in instructor-embedding are comparing it to the libraries listed below

Sorting:

Muennighoff / sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
☆869Updated last year
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆952Updated 9 months ago
IntelLabs / fastRAG
Efficient Retrieval Augmentation and Generation Framework
☆1,599Updated 6 months ago
MeetKai / functionary
Chat language model that can use tools and interpret the results
☆1,574Updated this week
gururise / AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
☆1,562Updated 2 years ago
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,048Updated last year
noamgat / lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
☆1,858Updated 5 months ago
explosion / spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
☆1,289Updated 6 months ago
stanford-futuredata / ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
☆3,511Updated last week
srush / MiniChain
A tiny library for coding with large language models.
☆1,235Updated last year
yaodongC / awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
☆1,126Updated last year
beir-cellar / beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆1,894Updated last month
jzbjyb / FLARE
Forward-Looking Active REtrieval-augmented generation (FLARE)
☆643Updated last year
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,536Updated last year
eyurtsev / kor
LLM(😽)
☆1,682Updated 5 months ago
embeddings-benchmark / mteb
MTEB: Massive Text Embedding Benchmark
☆2,727Updated this week
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,062Updated last year
texttron / hyde
HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels
☆539Updated 7 months ago
lucidrains / toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
☆2,041Updated last year
google-research / FLAN
☆1,529Updated 3 weeks ago
bigscience-workshop / promptsource
Toolkit for creating, sharing and using natural language prompts.
☆2,910Updated last year
AkariAsai / self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…
☆2,147Updated last year
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆821Updated 2 years ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
lupantech / chameleon-llm
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
☆1,134Updated last year
keirp / automatic_prompt_engineer
☆1,285Updated last year
marella / ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library.
☆1,871Updated last year
FranxYao / chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,742Updated 11 months ago
young-geng / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆2,491Updated 11 months ago
naver / splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
☆877Updated last year