A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, training, evaluate and application!
☆47Oct 8, 2025Updated 6 months ago
Alternatives and similar repositories for quickllm
Users that are interested in quickllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Generative Dialogue State Tracking Model☆23Jun 24, 2021Updated 4 years ago
- Deepseek-r1复现科普与资源汇总☆22Mar 5, 2025Updated last year
- ☆11Aug 29, 2022Updated 3 years ago
- Deepdive: Deep iterative thinking slash command for Claude Code - enables multi-round exploratory reasoning and non-linear problem-solvin…☆49Nov 9, 2025Updated 5 months ago
- A one-page WebUI integrating VITS inference, training, and output in Sherpa-Onnx format.☆12Feb 2, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 11 months ago
- 猛虎汽车故障云诊断系统☆13Dec 12, 2014Updated 11 years ago
- ☆15Jun 26, 2024Updated last year
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…☆131Dec 15, 2018Updated 7 years ago
- Implementation of Hilbert beamforming for SNN-based audio source localisation☆16Oct 2, 2024Updated last year
- ☆19Feb 18, 2025Updated last year
- ☆15Apr 4, 2025Updated last year
- 爬取豆瓣上各个类型的电影信息(名称,时间,类型,评分,评论数,简介等)☆11Mar 30, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆36Sep 6, 2024Updated last year
- Piece-wise CNN for relation extraction.☆13Oct 22, 2018Updated 7 years ago
- 介绍docker、docker compose的使用。☆21Sep 4, 2024Updated last year
- Data and codes for BioBERT-MRC☆11Oct 5, 2021Updated 4 years ago
- 开源知识图谱☆13May 26, 2022Updated 3 years ago
- Fine-Tune LLM Synthetic-Data application and "From Data to AGI: Unlocking the Secrets of Large Language Model"☆16Jul 5, 2024Updated last year
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- An elegent pytorch implement of transformers☆1,334Apr 10, 2026Updated last week
- Simulate Evidence Accumulation Models in Python☆23Nov 16, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆89Jun 27, 2023Updated 2 years ago
- Threat hunting in social media☆12Feb 17, 2019Updated 7 years ago
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 7 months ago
- Extending NERDA Library for Continual Learning☆11Mar 31, 2024Updated 2 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- ☆13Jun 3, 2020Updated 5 years ago
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆21Mar 12, 2026Updated last month
- CONditionals for Ordinal Regression and classification in PyTorch☆12Nov 5, 2022Updated 3 years ago
- Making large AI models cheaper, faster and more accessible☆15Apr 20, 2023Updated 2 years ago
- This is a Kaggle data mining contest, link: https://www.kaggle.com/c/avazu-ctr-prediction☆11Mar 12, 2015Updated 11 years ago
- Official code for infimm-hd☆16Sep 4, 2024Updated last year
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- 由于BAAI/bge-large-zh 在Hugging Face Clone不下来,手动下载下来,便于使用☆11Sep 16, 2023Updated 2 years ago