THUDM / GLM-Edge
GLM Series Edge Models
☆121Updated last week
Alternatives and similar repositories for GLM-Edge:
Users that are interested in GLM-Edge are comparing it to the libraries listed below
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆147Updated last week
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆188Updated last week
- SUS-Chat: Instruction tuning done right☆48Updated 11 months ago
- ☆193Updated 3 weeks ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆58Updated 6 months ago
- Mixture-of-Experts (MoE) Language Model☆184Updated 4 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆97Updated this week
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆127Updated 6 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆52Updated last month
- 我们是第一个完全可商用的角色大模型。☆37Updated 4 months ago
- ☆160Updated 3 weeks ago
- ☆78Updated 8 months ago
- 顾名思义:手搓的RAG☆116Updated 10 months ago
- Its an open source LLM based on MOE Structure.☆57Updated 6 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆108Updated last month
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆54Updated 8 months ago
- ☆36Updated 2 months ago
- A light proxy solution for HuggingFace hub.☆46Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆36Updated 4 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆201Updated last month
- ☆155Updated last month
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆168Updated 2 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆185Updated 2 weeks ago
- ☆84Updated last month
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆256Updated 2 months ago
- ☆219Updated 8 months ago
- zero零训练llm调参☆31Updated last year
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆259Updated 8 months ago
- Repo for Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆38Updated this week
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆132Updated 9 months ago