纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,
☆42Aug 16, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan-speedup
Users that are interested in baichuan-speedup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。☆70Aug 15, 2023Updated 2 years ago
- ggml implementation of the baichuan13b model (adapted from llama.cpp)☆55Jul 27, 2023Updated 2 years ago
- PFCC 社区博客☆14Updated this week
- 天池大赛:金融大脑-金融智能NLP服务☆16Jul 9, 2018Updated 7 years ago
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated 2 years ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- 我利用在windows10上编译的darknet,实现了安全帽检测功能,编写了一个简单的mfc demo利用我们生成的模型来实现目标检测功能。Safety Helmet Wearing Test☆24Nov 7, 2019Updated 6 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- 一个桌面宠物程序,现在似乎发展成为桌面便签了。桌面便签程序见develop-todolist分支。☆11Nov 17, 2024Updated last year
- 通过示例阐述如何使用pycrfsuite☆10Nov 7, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 使用langraph构建Agentic-RAG☆23Jul 30, 2025Updated 9 months ago
- ChatGLM-6B添加了RLHF的实现,以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成,以及指定context推荐的RLHF的实现☆88Aug 16, 2023Updated 2 years ago
- 对苏神的bert4keras的实现原理和矩阵运算进行详细的注释,方便学习;bert4keras链接:https://github.com/bojone/bert4keras☆42Dec 15, 2020Updated 5 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- Resources for Large Language Model Inference☆17Dec 29, 2023Updated 2 years ago
- an android sample using native activity and opengles and egl engine☆17Jul 8, 2017Updated 8 years ago
- Android runtime permissions manager☆13Jul 8, 2019Updated 6 years ago
- Comparison of existing spell checking tools☆11Mar 28, 2023Updated 3 years ago
- ☆10Dec 20, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 知识融合标注工具☆15Dec 19, 2018Updated 7 years ago
- Deep Learning Library for R☆12May 6, 2018Updated 8 years ago
- 使用ONNXRuntime部署面向 轻量实时的M-LSD直线检测,包含C++和Python两个版本的程序☆26Jan 21, 2023Updated 3 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Jul 27, 2024Updated last year
- 多轮共情对话模型PICA☆98Sep 11, 2023Updated 2 years ago
- Playground project acting as an example for a complex LangChain workflow☆11Jun 20, 2023Updated 2 years ago
- ☆15Jan 11, 2023Updated 3 years ago
- 智谱 glm realtime api python/golang/ts sdk, 包括 low level 的 websocket client 封装以及各个场景的调用样 例☆27May 27, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆159Jul 25, 2025Updated 9 months ago
- This project provides a face recoganization system via opencv4☆18Jan 16, 2019Updated 7 years ago
- 学习OpenGL的代码仓库☆14Apr 25, 2026Updated 2 weeks ago
- Examples of demo deployment using Gradio. Image Classification, Live Webcam Segmentation, APIs , Tunneling etc.☆17Oct 17, 2022Updated 3 years ago
- Deep-Learn model SSD_300x300 transplante to TensorRT(Nvidia Jetson Tx2)☆11Dec 8, 2018Updated 7 years ago
- Markdown to LaTeX☆19Jul 29, 2022Updated 3 years ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆34Nov 30, 2022Updated 3 years ago