swtheing / LLM-Performance-Improvement-Paper
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LLM-Performance-Improvement-Paper
- ☆120Updated 7 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆218Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆67Updated last week
- NTK scaled version of ALiBi position encoding in Transformer.☆67Updated last year
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆107Updated last year
- ☆158Updated last year
- ☆62Updated last year
- 怎么训练一个LLM分词器☆130Updated last year
- ☆129Updated 4 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评 基准☆78Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆53Updated 5 months ago
- ☆125Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆62Updated last year
- 中文 Instruction tuning datasets☆118Updated 7 months ago
- ☆30Updated this week
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 8 months ago
- ☆82Updated last year
- 文本去重☆67Updated 6 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆109Updated 5 months ago
- ☆40Updated 5 months ago
- ☆55Updated 4 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated 8 months ago
- Fantastic Data Engineering for Large Language Models☆51Updated 3 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆126Updated 2 months ago
- ☆89Updated last month
- ☆72Updated 10 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆133Updated 5 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆159Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆309Updated 2 months ago
- ☆93Updated 8 months ago