run-llama / finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
☆477Updated last year
Alternatives and similar repositories for finetune-embedding:
Users that are interested in finetune-embedding are comparing it to the libraries listed below
- ☆440Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆313Updated 2 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆388Updated 2 weeks ago
- ☆754Updated last year
- Automated Evaluation of RAG Systems☆526Updated 2 months ago
- ☆306Updated last year
- ☆815Updated 2 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆401Updated 11 months ago
- ☆492Updated 4 months ago
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆298Updated 2 months ago
- Generate textbook-quality synthetic LLM pretraining data☆492Updated last year
- Easily embed, cluster and semantically label text datasets☆488Updated 9 months ago
- Automatically evaluate your LLMs in Google Colab☆575Updated 8 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆722Updated last month
- HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels☆471Updated last month
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆728Updated 4 months ago
- 🦜💯 Flex those feathers!☆236Updated 2 months ago
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]☆556Updated 10 months ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆599Updated last year
- This repository implements the chain of verification paper by Meta AI☆160Updated last year
- Best practices for distilling large language models.☆424Updated 11 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,238Updated last month
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆684Updated 9 months ago
- ☆484Updated last month
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆145Updated 9 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆307Updated 3 weeks ago
- awesome synthetic (text) datasets☆253Updated 2 months ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆583Updated last year
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆284Updated this week