Another ChatGLM2 implementation for GPTQ quantization
☆55Oct 15, 2023Updated 2 years ago
Alternatives and similar repositories for chatglm-q
Users that are interested in chatglm-q are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆90Jun 30, 2023Updated 2 years ago
- run ChatGLM2-6B in BM1684X☆49Mar 1, 2024Updated 2 years ago
- 基于自由度(熵)、凝固度 新词发现算法实现☆12Oct 7, 2018Updated 7 years ago
- 针对2018年起东南大学新更换的选课系统所写的刷课软件☆54Dec 20, 2019Updated 6 years ago
- fastertransformer for codegeex model☆65Jun 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An End-to-end Mutually Interactive Emotion-Cause Pair Extractor via Soft-sharing☆13Aug 11, 2022Updated 3 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- An implementation of LazyLLM token pruning for LLaMa 2 model family.☆13Jan 6, 2025Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能 力(3)兼容alpaca数据格式。☆45Jul 19, 2023Updated 2 years ago
- LLaMa/RWKV onnx models, quantization and testcase☆366Jul 6, 2023Updated 2 years ago
- ☆14Nov 20, 2022Updated 3 years ago
- ☆44Apr 7, 2026Updated last week
- Python library for adding visual effects to video streams☆11Dec 20, 2019Updated 6 years ago
- 本项目采用PyTorch和transformers模块实现英语序列标注,其中对BERT进行微调。☆19Feb 1, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago
- llm deploy project based mnn. This project has merged into MNN.☆1,615Jan 20, 2025Updated last year
- Asynchronous event I/O driven quantitative trading framework.☆12Aug 29, 2020Updated 5 years ago
- Echelon Blockchain Node - Cosmos SDK, IBC, and EVM compatible☆17Jan 19, 2026Updated 2 months ago
- use chatGLM to perform text embedding☆45Apr 9, 2023Updated 3 years ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆360Aug 22, 2023Updated 2 years ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,194May 3, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Estimating hardware and cloud costs of LLMs and transformer projects☆21Apr 1, 2026Updated last week
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆416Oct 21, 2023Updated 2 years ago
- C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)☆2,962Jul 31, 2024Updated last year
- ☆15Aug 21, 2023Updated 2 years ago
- Simple implementation of using lora form the peft library to fine-tune the chatglm-6b☆84Apr 3, 2023Updated 3 years ago
- Asynchronous event I/O driven quantitative trading framework.☆20Mar 15, 2021Updated 5 years ago
- Adversarial learning by utilizing model interpretation☆10Oct 19, 2018Updated 7 years ago
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆4,187Updated this week
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 一个为RAG系统设计的Markdown文档工具,提供标题结构自动抽取和文档分割两大功能。完整保留文档层级结构,解决传统切分器丢失标题层级与破坏表格完整性的问题。A hierarchy-preserving Markdown document splitter for RAG…☆13Jan 2, 2025Updated last year
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,723Oct 12, 2023Updated 2 years ago
- ☆59Aug 1, 2023Updated 2 years ago
- This is Microsoft-Phi-3-NvidiaNIMWorkshop☆22Aug 16, 2024Updated last year
- ☆17Jun 1, 2022Updated 3 years ago
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆112Apr 1, 2023Updated 3 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago