MobiSense / SpecOffload-publicView external linksLinks
☆29Feb 3, 2026Updated last week
Alternatives and similar repositories for SpecOffload-public
Users that are interested in SpecOffload-public are comparing it to the libraries listed below
Sorting:
- ☆35Nov 28, 2024Updated last year
- 基于pytorch_rnn的古诗词生成☆10Oct 24, 2021Updated 4 years ago
- ☆10May 14, 2023Updated 2 years ago
- PyTorch code for full quantization of DNN using BCGD☆14Jul 24, 2019Updated 6 years ago
- NLP models and codes for BAAI-JD joint project.☆10May 27, 2020Updated 5 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization☆20Jan 28, 2026Updated 2 weeks ago
- ☆22Oct 7, 2025Updated 4 months ago
- Convolutional Neural Network for Text Classification in Tensorflow☆10Apr 3, 2017Updated 8 years ago
- Training models with ternary quantized weights using PyTorch☆15Jun 12, 2019Updated 6 years ago
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Dec 27, 2025Updated last month
- ☆10Mar 14, 2020Updated 5 years ago
- Repo for my Master Thesis at ULiège in 2019 (Machine learning under resource constraints)☆10Jun 29, 2019Updated 6 years ago
- ☆10Dec 15, 2018Updated 7 years ago
- ☆18Oct 29, 2025Updated 3 months ago
- ☆14Mar 21, 2020Updated 5 years ago
- 基于Bert、Pytorch的中文短文本分类任务☆13Nov 2, 2022Updated 3 years ago
- ☆13Mar 6, 2023Updated 2 years ago
- Discord爬蟲,把Discord頻道所有內容全部抓下來儲存☆12Jan 31, 2021Updated 5 years ago
- ☆17Feb 18, 2025Updated 11 months ago
- ☆20Jun 9, 2025Updated 8 months ago
- [DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"☆101Dec 15, 2025Updated 2 months ago
- Large-scale exact string matching tool☆17Mar 7, 2025Updated 11 months ago
- Code for training binary and WTA SNNs☆17Mar 25, 2022Updated 3 years ago
- ☆16Jun 18, 2025Updated 7 months ago
- CS294 AI Systems Class Website☆17Apr 25, 2022Updated 3 years ago
- DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment☆16Dec 1, 2016Updated 9 years ago
- Stagehand-GLM 是基于 stagehand-python 深度定制的AI浏览器自动化框架,专门适配了智谱AI的GLM文本和多模态大模型。它提供了渐进式的RPA操作策略,让开发者在智能便捷和成本效益之间找到最佳平衡点。☆27Aug 18, 2025Updated 5 months ago
- This repo is an implementation of quantized CNN for both weights (1-bit compression) and feature maps (2-bit compression).☆18Aug 22, 2018Updated 7 years ago
- 自然语言处理之中文文本分类(以垃圾短信识别为例)☆23Jun 4, 2020Updated 5 years ago
- ☆18May 25, 2018Updated 7 years ago
- ☆26Dec 29, 2019Updated 6 years ago
- ☆19Jun 15, 2020Updated 5 years ago
- Compression of Deep Neural Networks LeNet-300-100 and LeNet-5 trained on MNIST and CIFAR-10 using Quantization, Knowledge Distillation & …☆20Aug 22, 2019Updated 6 years ago
- Implementation of a Quantized Transformer Model☆19Mar 20, 2019Updated 6 years ago
- Knowledge Distillation Algorithms implemented with PyTorch☆17Jul 23, 2019Updated 6 years ago
- 演示视频在魔搭社区☆22Mar 6, 2025Updated 11 months ago
- ☆21Dec 6, 2025Updated 2 months ago
- An implementation of several models (BiLSTM-CRF, BiLSTM-CNN, BiLSTM-BiLSTM) for Medical Named Entity Recognition (NER)☆19Dec 22, 2024Updated last year