zhjunqin / MachineLearning
机器学习基础
☆7Updated 5 years ago
Related projects: ⓘ
- 模型压缩的小白入门教程☆23Updated 2 months ago
- 博客信息☆22Updated this week
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆47Updated last year
- 别名发现系统☆11Updated 2 years ago
- Bert TensorRT模型加速部署☆9Updated 2 years ago
- 百度QA100万数据集☆48Updated 9 months ago
- Pytorch自动混合精度训练模板☆17Updated 2 years ago
- 利用tensorflow/serving进行单模型、多模型、同一模型多版本的部署,并进行模型预测,并用Prothemus进行服务监控。☆11Updated 3 years ago
- ☆12Updated 9 months ago
- 本插件是将faiss集成到greenplum数据库中,以提供向量召回的能力。☆20Updated 2 years ago
- The Python solutions of leetcode☆12Updated 4 years ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆24Updated 4 months ago
- Automatically Generated d2l-zh PyTorch Notebooks for SageMaker☆9Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆39Updated 11 months ago
- This program shows how Bounding-Box-Regression works in a visual form. Intersection over Union ( IOU ), Non Maximum Suppression ( NMS ),…☆17Updated 4 years ago
- learn some Machine Learning algorithm with python☆13Updated 5 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆40Updated 11 months ago
- Self-study - Deep learning network for CTR : FM, DeepFM, PNN, NFM, DCN, Wide&Deep, etc☆16Updated 3 years ago
- ☆15Updated 7 months ago
- 模型可视化工具netron的Flask版本☆14Updated 2 years ago
- run ChatGLM2-6B in BM1684X☆48Updated 6 months ago
- Deep Interest Network for Click-Through Rate Prediction Deep Interest Evolution Network for Click-Through Rate Prediction☆10Updated 3 years ago
- Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference.☆14Updated this week
- Large-scale exact string matching tool☆15Updated 11 months ago
- Deploy ChatGLM on Modelz☆15Updated last year
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Updated last year
- 源项目https://github.com/imClumsyPanda/langchain-ChatGLM 本仓库用于调试与colab拉取☆9Updated 11 months ago
- All code of tensorflow☆7Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆24Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆44Updated last year