A-baoYang / instruction-finetune-datasets
Collect and maintain high quality instruction finetune datasets in different domain and languages. 搜集並維護高品質各專業領域及語言的指令微調資料集
☆19Updated last year
Alternatives and similar repositories for instruction-finetune-datasets:
Users that are interested in instruction-finetune-datasets are comparing it to the libraries listed below
- Finetune LLaMA-7B with Chinese instruction datasets☆137Updated last year
- Arrange methods and example on finetune LLMs☆74Updated 8 months ago
- Collection of ChatGPT alternatives & LLM tuning methods☆12Updated 2 years ago
- Fine-Tuning LLM and embedding models☆27Updated last year
- just collections about Llama2☆44Updated 6 months ago
- ☆51Updated 8 months ago
- Evaluation for AI apps and agent☆36Updated last year
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆136Updated 2 years ago
- Code implement reposity of Paper HiQA☆99Updated 3 weeks ago
- Tutorials from AutoGen Basics to Use Cases☆29Updated last year
- finetune llama2 with traditional chinese dataset☆38Updated last year
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated 9 months ago
- kimi-chat 测试数据☆7Updated last year
- Code and data repo for "DiagGPT"☆47Updated 11 months ago
- 使用繁體中文資料集做的 Embedding 模型評測☆45Updated 8 months ago
- CodeLLaMA 中文版 - 代码生成助手,huggingface累积下载2w+次☆45Updated last year
- FuseAI Project☆84Updated 2 months ago
- AGI模块库架构图☆75Updated last year
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆85Updated last year
- ☆49Updated 8 months ago
- Langport is a language model inference service☆94Updated 6 months ago
- ☆83Updated last year
- Fine-Tune LLM Synthetic-Data application and "From Data to AGI: Unlocking the Secrets of Large Language Model"☆16Updated 8 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆41Updated 8 months ago
- ☆74Updated 11 months ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆53Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆84Updated 2 weeks ago
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆20Updated last month
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆38Updated last week