01-ai / Yi
A series of large language models trained from scratch by developers @01-ai
☆7,598Updated last week
Related projects: ⓘ
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆13,305Updated 2 weeks ago
- Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)☆30,812Updated this week
- Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.☆7,468Updated this week
- Official release of InternLM2.5 base and chat models. 1M context support☆6,231Updated last week
- High-speed Large Language Model Serving on PCs with Consumer-grade GPUs☆7,877Updated last week
- Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory☆15,611Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆26,822Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆19,294Updated last month
- QLoRA: Efficient Finetuning of Quantized LLMs☆9,906Updated 3 months ago
- Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datase…☆11,582Updated last week
- Retrieval and Retrieval-augmented LLMs☆6,824Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆17,176Updated this week
- ModelScope: bring the notion of Model-as-a-Service to life.☆6,794Updated this week
- DeepSeek Coder: Let the Code Write Itself☆6,530Updated 3 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,194Updated last month
- Question and Answer based on Anything.☆11,376Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆3,415Updated last month
- Universal LLM Deployment Engine with ML Compilation☆18,642Updated this week
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆12,397Updated 2 weeks ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆17,179Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆15,839Updated this week
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)☆7,654Updated last year
- Large Language Model Text Generation Inference☆8,762Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆21,609Updated last month
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆4,793Updated this week
- AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents☆13,255Updated last week
- a state-of-the-art-level open visual language model | 多模态预训练模型☆5,871Updated 3 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆9,780Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆36,446Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,620Updated 4 months ago