ziwang-com / zero-lora
zero零训练llm调参
☆30Updated last year
Related projects ⓘ
Alternatives and complementary repositories for zero-lora
- 基于baichuan-7b的开源多模态大语言模型☆72Updated 11 months ago
- SUS-Chat: Instruction tuning done right☆47Updated 9 months ago
- 全球首个StableVicuna中文优化版。☆65Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆132Updated 7 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limi…☆25Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 6 months ago
- Light local website for displaying performances from different chat models.☆85Updated 11 months ago
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆89Updated last year
- A light proxy solution for HuggingFace hub.☆44Updated last year
- ☆28Updated 2 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- Imitate OpenAI with Local Models☆85Updated 2 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 6 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆25Updated 5 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆57Updated 4 months ago
- 旨在对当前主流LLM进行一个直观、具体、标准的评测☆92Updated last year
- 文本去重☆67Updated 5 months ago
- ☆72Updated 10 months ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆85Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆121Updated 10 months ago
- 中文原生检索增强生成测评基准☆98Updated 6 months ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆89Updated last year
- ☆173Updated last year
- large language model training-3-stages+deployment☆46Updated last year
- A more efficient GLM implementation!☆55Updated last year
- 骆驼QA,中文大语言阅读理解模型。☆72Updated last year
- Just for debug☆56Updated 8 months ago