run-llama / finetune-embeddingLinks

Fine-Tuning Embedding for RAG with Synthetic Data

☆518

Alternatives and similar repositories for finetune-embedding

Users that are interested in finetune-embedding are comparing it to the libraries listed below

Sorting:

Raudaschl / rag-fusion
☆902Updated last year
h2oai / h2o-wizardlm
Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning
☆309Updated last year
philschmid / easyllm
☆468Updated last year
arcee-ai / DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
☆332Updated last year
run-llama / modal_finetune_sql
☆320Updated 2 years ago
langchain-ai / langchain-benchmarks
🦜💯 Flex those feathers!
☆255Updated last year
langchain-ai / auto-evaluator
☆778Updated 5 months ago
texttron / hyde
HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels
☆560Updated 11 months ago
KarelDO / xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆443Updated last year
stanford-futuredata / ARES
Automated Evaluation of RAG Systems
☆674Updated 8 months ago
finic-ai / doctran
☆508Updated last year
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆507Updated 2 years ago
wandb / wandbot
wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk
☆310Updated last month
ritun16 / chain-of-verification
This repository implements the chain of verification paper by Meta AI
☆181Updated 2 years ago
rajshah4 / LLM-Evaluation
Sample notebooks and prompts for LLM evaluation
☆156Updated last month
huggingface / text-clustering
Easily embed, cluster and semantically label text datasets
☆584Updated last year
deepset-ai / haystack-tutorials
Here you can find all the Tutorials for Haystack 📓
☆350Updated this week
BatsResearch / bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
☆802Updated 4 months ago
castorini / rank_llm
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
☆555Updated this week
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆671Updated last year
langchain-ai / text-split-explorer
☆267Updated 2 years ago
gabrielchua / RAGxplorer
Open-source tool to visualise your RAG 🔮
☆1,199Updated 10 months ago
jzbjyb / FLARE
Forward-Looking Active REtrieval-augmented generation (FLARE)
☆658Updated 2 years ago
AymenKallala / RAG_Maestro
Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.
☆167Updated last year
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆179Updated last year
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆630Updated last year
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆306Updated last year
CYQIQ / MultiCoT
Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph
☆147Updated last year
jina-ai / late-chunking
Code for explaining and evaluating late chunking (chunked pooling)
☆469Updated 11 months ago
cohere-ai / notebooks
Code examples and jupyter notebooks for the Cohere Platform
☆506Updated 10 months ago