Multi-branch model for concurrent execution
☆18Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for multiBranchModel
Users that are interested in multiBranchModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆78May 28, 2023Updated 2 years ago
- A simple yet high performance web server written with epoll and pure c.☆18Jun 7, 2019Updated 6 years ago
- ☆19Feb 28, 2022Updated 4 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆176May 15, 2022Updated 3 years ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆37Aug 29, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This is a list of awesome edgeAI inference related papers.☆99Dec 21, 2023Updated 2 years ago
- ☆33Mar 31, 2025Updated last year
- This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.☆88Oct 25, 2024Updated last year
- ☆135Feb 12, 2026Updated last month
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆25Jan 4, 2026Updated 3 months ago
- ☆13Mar 10, 2026Updated 3 weeks ago
- Coco datasets Visualization.☆10Aug 9, 2021Updated 4 years ago
- The notes of Java interview, we can visit https://cornprincess.github.io/Backend_Notes/ to read notes. The visitor in China can browser☆11Feb 23, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Implementations of different neural network pruning techniques☆14Aug 10, 2023Updated 2 years ago
- Source code (Python, Node.js and Java) for a demo we built which has been shown at a number of conferences, including IoT Solutions World…☆13Jan 9, 2023Updated 3 years ago
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆18Mar 25, 2022Updated 4 years ago
- ☆38Jun 27, 2025Updated 9 months ago
- 微信Ipad协议golang版本,基于grpc的实现策略。这套代码需要通过gprc服务端组包解包才可以正常使用☆13Jul 8, 2019Updated 6 years ago
- edge/mobile transformer based Vision DNN inference benchmark☆16Aug 29, 2025Updated 7 months ago
- ☆39Mar 14, 2024Updated 2 years ago
- ☆102Jan 17, 2024Updated 2 years ago
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆24Oct 25, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆10Jun 28, 2022Updated 3 years ago
- Evaluate Transformers from the Hub 🔥☆14Updated this week
- handwritten digits recognition by M5Stack☆16Aug 17, 2018Updated 7 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- survey and analysis of kv-stores in academia and industry☆10Aug 31, 2019Updated 6 years ago
- 🧑🚀 Professional translation and reading of English academic papers in PDF format.☆11Nov 2, 2023Updated 2 years ago
- A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…☆49Feb 13, 2026Updated last month
- AVPipe :-)☆12Jul 16, 2021Updated 4 years ago
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Mar 30, 2020Updated 6 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆55Dec 11, 2022Updated 3 years ago
- ☆52Dec 13, 2022Updated 3 years ago
- ☆212Jan 17, 2024Updated 2 years ago
- ☆15Jun 26, 2024Updated last year
- Generative Models for Image Captioning☆10Jun 7, 2017Updated 8 years ago
- Process CCPD picture filenames, translate car plate into readable Chinese, draw bounding boxes and make corresponding txt for YOLO.☆11May 17, 2021Updated 4 years ago