THUDM / CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
☆8,244Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for CodeGeeX
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)☆7,659Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆37,015Updated this week
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,415Updated 6 months ago
- ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.☆9,431Updated 4 months ago
- Home of StarCoder: fine-tuning & inference!☆7,327Updated 8 months ago
- ModelScope: bring the notion of Model-as-a-Service to life.☆7,029Updated this week
- CodeGeeX2: A More Powerful Multilingual Code Generation Model☆7,637Updated 4 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,270Updated 3 months ago
- ☆9,008Updated 7 months ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆7,933Updated last month
- An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.☆8,277Updated this week
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆12,676Updated this week
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,170Updated 8 months ago
- Making large AI models cheaper, faster and more accessible☆38,825Updated this week
- Universal LLM Deployment Engine with ML Compilation☆19,231Updated this week
- Instruct-tune LLaMA on consumer hardware☆18,657Updated 3 months ago
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,432Updated 2 months ago
- High-performance In-browser LLM Inference Engine☆13,700Updated last week
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆29,567Updated 4 months ago
- The AI Code Editor☆25,429Updated last month
- Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca☆4,146Updated last week
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆23,720Updated last month
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,675Updated 4 months ago
- ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型☆40,717Updated 4 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆14,241Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆35,538Updated this week
- The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.☆21,073Updated 4 months ago
- ☆34,548Updated 10 months ago
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆9,839Updated this week