cavalierlulu / rag_surveyLinks
☆124Updated last year
Alternatives and similar repositories for rag_survey
Users that are interested in rag_survey are comparing it to the libraries listed below
Sorting:
- ☆144Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆244Updated 8 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆127Updated last year
- ☆178Updated 3 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated this week
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆197Updated 6 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆160Updated last year
- A curated list of resources dedicated to retrieval-augmented generation (RAG).☆116Updated 2 weeks ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- 怎么训练一个LLM分词器☆151Updated 2 years ago
- ☆142Updated last year
- A Toolkit for Table-based Question Answering☆112Updated last year
- an intro to retrieval augmented large language model☆297Updated last year
- ☆172Updated last year
- ☆48Updated last year
- Fantastic Data Engineering for Large Language Models☆89Updated 6 months ago
- A live reading list for LLM-synthetic-data.☆307Updated this week
- Collection of training data management explorations for large language models☆327Updated 11 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆179Updated 10 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆162Updated 2 weeks ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆85Updated 8 months ago
- ☆96Updated last year
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆44Updated last month
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Updated last year
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆81Updated 10 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆140Updated 2 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆263Updated last year
- ☆162Updated 2 years ago
- Counting-Stars (★)☆83Updated last month
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆89Updated last year