A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, training, evaluate and application!
☆47Oct 8, 2025Updated 7 months ago
Alternatives and similar repositories for quickllm
Users that are interested in quickllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- A Generative Dialogue State Tracking Model☆23Jun 24, 2021Updated 4 years ago
- ☆11Aug 29, 2022Updated 3 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆44Jul 19, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Reconstruction ICA☆10Aug 25, 2017Updated 8 years ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- ☆22Dec 18, 2024Updated last year
- WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…☆131Dec 15, 2018Updated 7 years ago
- Research project for task-oriented dialogue system with jointly training multi-intent classification and slot filling☆10Sep 11, 2023Updated 2 years ago
- #UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning☆14May 23, 2022Updated 4 years ago
- ☆15Apr 4, 2025Updated last year
- [Ebook]从零到百万店铺:一个没有计算机学位的普通人的系统设计实战之旅☆27Nov 11, 2025Updated 6 months ago
- 爬取豆瓣上各个类型的电影信息(名称,时间,类型,评分,评论数,简介等)☆11Mar 30, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 大家好!我是功能丰富的 MCP 服务,旨在打破设备与服务的隔阂,为用户带来便捷体验。 天气工具和气象平台联动,快速为用户推送全球实时天气,助力大家规划出行。控制浏览器工具模拟人工操作,自动搜索、浏览网页,大幅节省时间。摄像头工具调用本地摄像头拍照、录像,实现人脸识别,保障家…☆14Apr 9, 2025Updated last year
- ☆36Sep 6, 2024Updated last year
- 智能客服 基于springboot+swaggger+elasticsearch+mysql☆11Aug 22, 2018Updated 7 years ago
- Piece-wise CNN for relation extraction.☆13Oct 22, 2018Updated 7 years ago
- 介绍docker、docker compose的使用。☆21Sep 4, 2024Updated last year
- demos based on PSpider☆17Mar 1, 2019Updated 7 years ago
- 开源知识图谱☆13May 26, 2022Updated 4 years ago
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- An elegent pytorch implement of transformers☆1,332May 16, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Train Joint_NLU model using Chinese 中文意图和槽联合模型 tensorflow实现和pytorch实现☆15Feb 6, 2020Updated 6 years ago
- 智枢多模态应急减灾智能平台,基于哈工大优势学科,深度融合卫星遥感、产业分布、物联网感知、社交媒体等多源异构数据,构建了包括洪水模型,气象模型,地震模型,野火模型等在内的智能体集群,精确识别灾情、量化评估灾损,实现灾害管理,填补我国巨灾模型多智能体平台的空白☆35Aug 15, 2025Updated 9 months ago
- 使用多轮对话数据集对deepseek进行lora微调教程☆60Dec 26, 2024Updated last year
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 2 months ago
- NEXRAD Level 2 radar data visualization using python and Panda3D☆18Jan 4, 2026Updated 4 months ago
- B站,小红书,知乎同名☆16Feb 20, 2023Updated 3 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆88Jun 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 8 months ago
- 一个集成jupyterlab编辑器的hanlp docker 镜像,并且使用github actions将镜像推送到自己的镜像仓库,便于快速体验hanlp☆11Dec 2, 2020Updated 5 years ago
- 使用BERT构建多标签标注模型☆41Feb 23, 2020Updated 6 years ago
- ☆13Jun 3, 2020Updated 5 years ago
- 处理银河系中性氢谱线的数据,可以得到银河系的悬臂结构图和旋转曲线,README.md文档中给出了观测设备推荐和代码的使用过程☆11Oct 1, 2025Updated 7 months ago
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- Making large AI models cheaper, faster and more accessible☆15Apr 20, 2023Updated 3 years ago