A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, training, evaluate and application!
☆47Oct 8, 2025Updated 7 months ago
Alternatives and similar repositories for quickllm
Users that are interested in quickllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Generative Dialogue State Tracking Model☆23Jun 24, 2021Updated 4 years ago
- Serverless LLM Inference: Deploy DeepSeek R1 & LLaMA Models on AWS Lambda with Ultra-Fast Cold Starts☆13Feb 3, 2026Updated 3 months ago
- ☆11Aug 29, 2022Updated 3 years ago
- Deepdive: Deep iterative thinking slash command for Claude Code - enables multi-round exploratory reasoning and non-linear problem-solvin…☆49Nov 9, 2025Updated 6 months ago
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆45Jul 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 本项目基于modelscope-agent-v0.3和 api-for-open-llm 或 llamacpp 组件共同实现了一个AI Agent,能够利用本地的大模型(LLM)实现使用自定义工具功能。使用了Qwen1.5大模型。☆19Apr 10, 2024Updated 2 years ago
- 猛虎汽车故障云诊断系统☆13Dec 12, 2014Updated 11 years ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- ☆22Dec 18, 2024Updated last year
- WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…☆131Dec 15, 2018Updated 7 years ago
- Research project for task-oriented dialogue system with jointly training multi-intent classification and slot filling☆10Sep 11, 2023Updated 2 years ago
- ☆19Feb 18, 2025Updated last year
- ☆15Apr 4, 2025Updated last year
- [Ebook]从零到百万店铺:一个没有计算机学位的普通人的系统设计实战之旅☆27Nov 11, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 微信Ipad协议golang版本,基于grpc的实现策略。这套代码需要通过gprc服务端组包解包才可以正常使用☆13Jul 8, 2019Updated 6 years ago
- 爬取豆瓣上各个类型的电影信息(名称,时间,类型,评分,评论数,简介等)☆11Mar 30, 2019Updated 7 years ago
- ☆36Sep 6, 2024Updated last year
- 智能客服 基于springboot+swaggger+elasticsearch+mysql☆11Aug 22, 2018Updated 7 years ago
- Piece-wise CNN for relation extraction.☆13Oct 22, 2018Updated 7 years ago
- 介绍docker、docker compose的使用。☆21Sep 4, 2024Updated last year
- demos based on PSpider☆17Mar 1, 2019Updated 7 years ago
- Data and codes for BioBERT-MRC☆11Oct 5, 2021Updated 4 years ago
- 兼容 GPT2、Bloom 等 Pytorch 框架下的语言模型、人工智能标记语言 (AIML) 和任务型对话系统 (Task) 的深度中文智能对话机器人框架☆26Jun 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆29Feb 3, 2026Updated 3 months ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- An elegent pytorch implement of transformers☆1,335May 1, 2026Updated last week
- Train Joint_NLU model using Chinese 中文意图和槽联合模型 tensorflow实现和pytorch实现☆15Feb 6, 2020Updated 6 years ago
- SUS-Chat: Instruction tuning done right☆49Jan 16, 2024Updated 2 years ago
- 主要是用python进行生存分析的步骤,包括生存分析(逐步和单因素),KM曲线、决策曲线,ROC曲线,训练测试样本分布比较☆11Dec 21, 2020Updated 5 years ago
- h5打开微信小程序/h5跳转微信小程序☆10Mar 21, 2022Updated 4 years ago
- 智枢多模态应急减灾智能平台,基于哈工大优势学科,深度融合卫星遥感、产业分布、物联网感知、社交媒体等多源异构数据,构建了包括洪水模型,气象模型,地震模型,野火模型等在内的智能体集群,精确识别灾情、量化评估灾损,实现灾害管理,填补我国巨灾模型多智能体平台的空白☆35Aug 15, 2025Updated 8 months ago
- 使用多轮对话数据集对deepseek进行lora微调教程☆60Dec 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 2 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆89Jun 27, 2023Updated 2 years ago
- Threat hunting in social media☆12Feb 17, 2019Updated 7 years ago
- 一个集成jupyterlab编辑器的hanlp docker 镜像,并且使用github actions将镜像推送到自己的镜像仓库,便于快速体验hanlp☆11Dec 2, 2020Updated 5 years ago
- 使用BERT构建多标签标注模型☆42Feb 23, 2020Updated 6 years ago