xlang-ai / instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
☆1,921Updated 2 months ago
Alternatives and similar repositories for instructor-embedding:
Users that are interested in instructor-embedding are comparing it to the libraries listed below
- SGPT: GPT Sentence Embeddings for Semantic Search☆864Updated last year
- Efficient Retrieval Augmentation and Generation Framework☆1,489Updated 2 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆922Updated 4 months ago
- MTEB: Massive Text Embedding Benchmark☆2,315Updated this week
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated last year
- Customizable implementation of the self-instruct paper.☆1,039Updated last year
- YaRN: Efficient Context Window Extension of Large Language Models☆1,450Updated 11 months ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,694Updated 7 months ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,271Updated 4 months ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,112Updated last year
- A tiny library for coding with large language models.☆1,225Updated 8 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,852Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆690Updated 11 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,459Updated 7 months ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆617Updated last year
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,740Updated 3 weeks ago
- HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels☆519Updated 3 months ago
- Alpaca dataset from Stanford, cleaned and curated☆1,544Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,122Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆709Updated last year
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,059Updated last year
- LLM(😽)☆1,661Updated last month
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,005Updated 9 months ago
- Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard☆524Updated this week
- Fine-Tuning Embedding for RAG with Synthetic Data☆487Updated last year
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]☆581Updated last year
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆831Updated 10 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,762Updated 7 months ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆1,983Updated 2 months ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆3,888Updated this week