GLM Series Edge Models
☆162Jun 12, 2025Updated 10 months ago
Alternatives and similar repositories for GLM-Edge
Users that are interested in GLM-Edge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆82Jul 4, 2024Updated last year
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆279Apr 23, 2026Updated last week
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- ☆192Mar 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GLM-4-Voice | 端到端中英语音对话模型☆3,178Dec 5, 2024Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆273Aug 6, 2025Updated 8 months ago
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated 2 years ago
- An open-sourced end-to-end VLM-based GUI Agent☆1,174Apr 4, 2025Updated last year
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 9 months ago
- Strong and Open Vision Language Assistant for Mobile Devices☆1,350Apr 15, 2024Updated 2 years ago
- Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…☆1,575Jun 14, 2025Updated 10 months ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆7,076Jul 4, 2025Updated 9 months ago
- ✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction☆2,508Mar 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆323Sep 18, 2024Updated last year
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- a benckmark for evaluating logical reasoning of LLMs☆23Jan 25, 2024Updated 2 years ago
- ☆53Oct 29, 2024Updated last year
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆244Sep 30, 2024Updated last year
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆1,180Jul 15, 2025Updated 9 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,437Mar 3, 2025Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- 【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models☆2,316Jul 15, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆307Jul 1, 2025Updated 10 months ago
- MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks☆8,847Feb 11, 2026Updated 2 months ago
- ☆18Dec 7, 2023Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- gradio bbox labeling tools☆11May 12, 2023Updated 2 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 11 months ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,101Mar 29, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- GGUF parser in Python☆28Aug 15, 2024Updated last year
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Sep 22, 2024Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- ☆23Jan 29, 2026Updated 3 months ago
- ☆20Aug 13, 2024Updated last year