ArtificialZeng / baichuan-speedupView external linksLinks
纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,
☆43Aug 16, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan-speedup
Users that are interested in baichuan-speedup are comparing it to the libraries listed below
Sorting:
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- ggml implementation of the baichuan13b model (adapted from llama.cpp)☆55Jul 27, 2023Updated 2 years ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- PFCC 社区博客☆14Updated this week
- a lightweight deep learning framework for CSK60XX serial products☆25Feb 6, 2026Updated last week
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- This project provides a face recoganization system via opencv4☆18Jan 16, 2019Updated 7 years ago
- Baichuan2代码的逐行解析版本,适合小白☆212Sep 20, 2023Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Jul 27, 2024Updated last year
- 我利用在windows10上编译的darknet,实现了安全帽检测功能,编写了一个简单的mfc demo利用我们生成的模型来实现目标检测功能。Safety Helmet Wearing Test☆23Nov 7, 2019Updated 6 years ago
- Implement Learning Efficient Convolutional Networks Through Network Slimming on YOLOX☆25Jun 9, 2022Updated 3 years ago
- TensorRT简明教程☆26Aug 11, 2021Updated 4 years ago
- 使用ONNXRuntime部署面向轻量实时的M-LSD直线检测,包含C++和Python两个版本的程序☆27Jan 21, 2023Updated 3 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago
- boss直聘自动打招呼 windows、mac 客户端☆27Dec 5, 2023Updated 2 years ago
- ChineseOcr Lite Mnn,超轻量级中文OCR PC Demo,使用MNN推理☆27Mar 26, 2021Updated 4 years ago
- The official repo for the paper "Teacher Forcing Recovers Reward Functions for Text Generation"☆31May 27, 2023Updated 2 years ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Nov 4, 2023Updated 2 years ago
- [ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization☆28Dec 6, 2023Updated 2 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 7 months ago
- ☆31Jun 18, 2021Updated 4 years ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- 一个桌面宠物程序,现在似乎发展成为桌面便签了。桌面便签程序见develop-todolist分支。☆11Nov 17, 2024Updated last year
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆143Jul 6, 2022Updated 3 years ago
- ☆38Mar 23, 2023Updated 2 years ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆35Nov 30, 2022Updated 3 years ago
- Tracking Of Agent (actions and belief) and Spatio-TEmporal Reasoning☆14Feb 7, 2020Updated 6 years ago
- Monte Carlo simulation of (4D) cone beam computed tomography☆11Dec 10, 2024Updated last year
- Code for paper 'MulViMotion: Shape-aware 3D Myocardial Motion Tracking from Multi-View Cardiac MRI'☆12Sep 2, 2022Updated 3 years ago
- A collection of alpha signals and settings submitted to WorldQuant BRAIN.☆23Jul 28, 2025Updated 6 months ago
- Official Tensorflow implementation of ISCL (Under review)☆10Oct 29, 2021Updated 4 years ago
- Official implementation of state-of-the-art multi-modal medical image deblurring (MedDeblur)☆11Jul 5, 2023Updated 2 years ago
- ☆22Dec 11, 2025Updated 2 months ago