pengr / LLM-Synthetic-Data
Real-time, fine-grained reading list on LLM-synthetic-data.🔥
☆194Updated last week
Alternatives and similar repositories for LLM-Synthetic-Data:
Users that are interested in LLM-Synthetic-Data are comparing it to the libraries listed below
- ☆161Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆328Updated 4 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆283Updated 5 months ago
- A series of technical report on Slow Thinking with LLM☆297Updated last week
- ☆128Updated 9 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆353Updated 5 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆230Updated 2 months ago
- ☆221Updated 8 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆236Updated last year
- The related works and background techniques about Openai o1☆192Updated last week
- 怎么训练一个LLM分词器☆137Updated last year
- ☆432Updated 2 weeks ago
- ☆120Updated 11 months ago
- ☆97Updated 6 months ago
- Awesome papers for role-playing with language models☆146Updated 2 months ago
- ☆95Updated 2 months ago
- ☆247Updated 5 months ago
- LLaMA Factory Document☆84Updated last month
- 使用单个24G显卡,从0开始训练LLM☆50Updated 2 months ago
- This is the repository for the Tool Learning survey.☆290Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆205Updated this week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆208Updated 3 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆182Updated last week
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆237Updated last month
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆261Updated 2 months ago
- ☆317Updated 6 months ago
- ☆136Updated 6 months ago
- Train a 1B LLM with 1T tokens from scratch by personal☆469Updated this week
- ☆159Updated last year
- an intro to retrieval augmented large language model☆271Updated last year