This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficiently on Qualcomm Cloud AI 100 accelerators.
☆87Mar 18, 2026Updated last week
Alternatives and similar repositories for efficient-transformers
Users that are interested in efficient-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A virtual city environment for traffic simulation, drone simulation, computer vision, etc.☆14Mar 20, 2026Updated last week
- This repository contains the results and code for the MLPerf™ Inference v1.0 benchmark.☆33Jul 24, 2025Updated 8 months ago
- MobileLLM-R1☆77Sep 30, 2025Updated 5 months ago
- RBLN Model Zoo — Compile once. Deploy anywhere.☆31Mar 3, 2026Updated 3 weeks ago
- Self-implemented NN operators for Qualcomm's Hexagon NPU☆53Sep 30, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Dev repo for power measurement for the MLPerf™ benchmarks☆28Sep 11, 2025Updated 6 months ago
- A planning interface based on CommonRoad which integrates into the Autoware.Universe software stack☆16Apr 29, 2025Updated 10 months ago
- Software kit for Qualcomm Cloud AI 100☆20Dec 15, 2025Updated 3 months ago
- This repository contains the results and code for the MLPerf™ Inference v2.1 benchmark.☆18Jul 24, 2025Updated 8 months ago
- Outdated. See https://github.com/saleae/jtag-analyzer for the official Saleae JTAG Analyzer☆12May 21, 2018Updated 7 years ago
- Bridge Autoware and Carla with Zenoh☆19Mar 2, 2026Updated 3 weeks ago
- Vector Bazel Rules and Toolchains☆15Mar 2, 2026Updated 3 weeks ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆105Oct 4, 2024Updated last year
- ☆14Dec 16, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Wireshark QMI dissector for Qualcomm based modems☆14Oct 2, 2025Updated 5 months ago
- ☆13Jan 5, 2014Updated 12 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.☆17Jul 24, 2025Updated 8 months ago
- ☆47May 20, 2025Updated 10 months ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- CK workflow, portable packages and other artifacts for the ReQuEST-ASPLOS'18 submission:☆15Oct 8, 2019Updated 6 years ago
- 3D-printed spare parts for SGI - Silicon Graphics Computer Systems☆18Feb 14, 2025Updated last year
- ☆30Nov 26, 2025Updated 4 months ago
- Guide by example of how to network boot IRIX from a raspberry pi.☆11Apr 1, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- K-Scale's library for programmatically interacting with OnShape☆45Sep 16, 2025Updated 6 months ago
- Anwsion is a simple ask&answer system writeen in PHP+MYSQL.☆16May 30, 2012Updated 13 years ago
- ROS node to exchange data between ROS and medical image computing software using the OpenIGTLink protocol.☆24Oct 11, 2021Updated 4 years ago
- (Experimental) ROS packages for Blue + Gazebo☆15Aug 4, 2019Updated 6 years ago
- The official PyTorch implementation of IEEE Transactions on Image Processing 2021 paper "Rethinking the U-shape Structure for Salient Obj…☆20Dec 1, 2022Updated 3 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- ☆13Feb 5, 2025Updated last year
- X.25 PAD for XOT☆16Jun 2, 2024Updated last year
- Quantize yolov7 using pytorch_quantization.🚀🚀🚀☆12Oct 20, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆42Jun 25, 2020Updated 5 years ago
- [AAAI'23] Memory-aided Contrastive Consensus Learning for Co-salient Object Detection.☆23Nov 12, 2024Updated last year
- AGX Dynamics for Unreal plugin.☆12Mar 20, 2026Updated last week
- Code for the paper Semantic-Guided Inpainting Network for Complex UrbanScenes Manipulation☆13Jul 7, 2021Updated 4 years ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆24Mar 19, 2026Updated last week
- https://github.com/shouxieai/hard_decode_trt windows编译版本☆13Sep 8, 2022Updated 3 years ago
- This crate is now part of the vm-virtio workspace: https://github.com/rust-vmm/vm-virtio☆15Mar 2, 2022Updated 4 years ago