THUDM / GLM-Edge
GLM Series Edge Models
☆137Updated 2 months ago
Alternatives and similar repositories for GLM-Edge:
Users that are interested in GLM-Edge are comparing it to the libraries listed below
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆72Updated 10 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆219Updated last week
- Mixture-of-Experts (MoE) Language Model☆186Updated 8 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- ☆225Updated 2 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 5 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 8 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 10 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated this week
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 10 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆232Updated 2 months ago
- ☆38Updated 6 months ago
- ☆78Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆51Updated this week
- ☆173Updated 3 months ago
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆119Updated this week
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆50Updated 2 weeks ago
- Imitate OpenAI with Local Models☆88Updated 8 months ago
- ☆227Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆36Updated last week
- ☆75Updated last month
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 8 months ago
- 顾名思义:手搓的RAG☆122Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆34Updated 9 months ago
- ☆310Updated 4 months ago
- A light proxy solution for HuggingFace hub.☆47Updated last year
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆58Updated last month
- ☆29Updated last year