run-llama / finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
☆494Updated last year
Alternatives and similar repositories for finetune-embedding:
Users that are interested in finetune-embedding are comparing it to the libraries listed below
- ☆766Updated last year
- ☆451Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆417Updated last year
- Automated Evaluation of RAG Systems☆576Updated 2 weeks ago
- 🦜💯 Flex those feathers!☆244Updated 5 months ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆293Updated this week
- ☆868Updated 5 months ago
- ☆496Updated 7 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆319Updated 5 months ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆622Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆498Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆432Updated 2 weeks ago
- ☆310Updated last year
- Efficient Retrieval Augmentation and Generation Framework☆1,513Updated 3 months ago
- This repository implements the chain of verification paper by Meta AI☆168Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆304Updated 5 months ago
- Easily embed, cluster and semantically label text datasets☆522Updated last year
- ☆257Updated last year
- Automatically evaluate your LLMs in Google Colab☆614Updated 11 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,375Updated 2 weeks ago
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]☆589Updated last year
- Open-source tool to visualise your RAG 🔮☆1,121Updated 3 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆783Updated 4 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated 11 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆693Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)☆366Updated 3 months ago
- A joint community effort to create one central leaderboard for LLMs.☆294Updated 7 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆764Updated last month
- data cleaning and curation for unstructured text☆329Updated 8 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆162Updated 6 months ago