纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,
☆42Aug 16, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan-speedup
Users that are interested in baichuan-speedup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- ggml implementation of the baichuan13b model (adapted from llama.cpp)☆55Jul 27, 2023Updated 2 years ago
- PFCC 社区博客☆14Updated this week
- Baichuan2代码的逐行解析版本,适合小白☆212Sep 20, 2023Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- ☆23Jul 17, 2023Updated 2 years ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Nov 4, 2023Updated 2 years ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- 飞桨模型加密库☆10Nov 13, 2021Updated 4 years ago
- 一个桌面宠物程序,现在似乎发展成为桌面便签了。桌面便签程序见develop-todolist分支。☆11Nov 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 对苏神的bert4keras的实现原理和矩阵运算进行详细的注释,方便学习;bert4keras链接:https://github.com/bojone/bert4keras☆42Dec 15, 2020Updated 5 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- Resources for Large Language Model Inference☆17Dec 29, 2023Updated 2 years ago
- Android runtime permissions manager☆13Jul 8, 2019Updated 6 years ago
- an android sample using native activity and opengles and egl engine☆17Jul 8, 2017Updated 8 years ago
- CHATGPT-In-Jupyter☆11Jun 2, 2023Updated 2 years ago
- 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)☆658Jun 30, 2023Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆416Jun 1, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 使用ONNXRuntime部署面向轻量实时的M-LSD直线检测,包含C++和Python两个版本的程序☆27Jan 21, 2023Updated 3 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- 多轮共情对话模型PICA☆97Sep 11, 2023Updated 2 years ago
- 智谱 glm realtime api python/golang/ts sdk, 包括 low level 的 websocket client 封装以及各个场景的调用样例☆25May 27, 2025Updated 10 months ago
- ☆15Jan 11, 2023Updated 3 years ago
- This project provides a face recoganization system via opencv4☆18Jan 16, 2019Updated 7 years ago
- AIxCC: automated vulnerability repair via LLMs, search, and static analysis☆12Jul 16, 2024Updated last year
- Examples of demo deployment using Gradio. Image Classification, Live Webcam Segmentation, APIs , Tunneling etc.☆17Oct 17, 2022Updated 3 years ago
- Deep-Learn model SSD_300x300 transplante to TensorRT(Nvidia Jetson Tx2)☆11Dec 8, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Markdown to LaTeX☆19Jul 29, 2022Updated 3 years ago
- Create RP training data from a VN, using GPT-4☆18Nov 2, 2023Updated 2 years ago
- 基于官方源码deepstream-test1修改,调用rtsp摄像头,并推理显示结果☆16Mar 11, 2020Updated 6 years ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆35Nov 30, 2022Updated 3 years ago
- Apply prompt learning in Chinese NER tasks☆13Mar 24, 2022Updated 4 years ago
- 将MNN拆解的简易前向推理框架(for study!)☆24Feb 21, 2021Updated 5 years ago
- 百度QA100万数据集☆45Nov 30, 2023Updated 2 years ago