[MobiCom 24] Adaptive DNN inference under memory constraints
☆57Jan 22, 2025Updated last year
Alternatives and similar repositories for FlexNN
Users that are interested in FlexNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing☆45Apr 25, 2026Updated last month
- The rknn2 API uses the secondary encapsulation of the process, which is easy for everyone to call. It is applicable to rk356x rk3588☆47Jun 18, 2022Updated 4 years ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated 2 years ago
- ☆13May 11, 2023Updated 3 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11May 19, 2025Updated last year
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- ☆17Oct 19, 2023Updated 2 years ago
- ☆33Jul 23, 2024Updated last year
- ☆16Jul 25, 2023Updated 2 years ago
- SGEMM optimization with cuda step by step☆22Mar 23, 2024Updated 2 years ago
- Paper list for Personal LLM Agents☆430May 8, 2024Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Lightning-fast LLM inference engine - Built with Rust (inspiration from https://github.com/GeeeekExplorer/nano-vllm)☆36Jun 24, 2025Updated 11 months ago
- Implementation for AutoIOT: LLM-Driven Automated Natural Language Programming for AIoT Applications☆38Mar 24, 2026Updated 2 months ago
- Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"☆93Oct 15, 2025Updated 8 months ago
- [ACL 2021] IrEne: Interpretable Energy Prediction for Transformers☆11Sep 8, 2021Updated 4 years ago
- ppstructure deploy by ncnn☆36Jul 16, 2024Updated last year
- ☆14Aug 1, 2020Updated 5 years ago
- ☆12Apr 19, 2022Updated 4 years ago
- Detect CPU features with single-file☆458May 22, 2026Updated 3 weeks ago
- The Free Software Media System. 适用于Rockchip SoC 和 RTD1296 的 Jellyfin,请使用已编译的镜像 https://hub.docker.com/u/jjm2473☆16Jan 29, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆67Mar 25, 2025Updated last year
- ☆16Jun 8, 2021Updated 5 years ago
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- Optimizing the Deployment of Tiny Transformers on Low-Power MCUs☆37Sep 2, 2024Updated last year
- ☆11Apr 12, 2022Updated 4 years ago
- Self-supervised Features Extraction for animal behavior discrimination☆26Jun 30, 2022Updated 3 years ago
- Efficient inference of large language models.☆152Sep 28, 2025Updated 8 months ago
- ☆11Mar 15, 2023Updated 3 years ago
- ☆67Dec 3, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Sep 11, 2020Updated 5 years ago
- CyclicSim: Different Variants of Cyclic Shapers in TSN☆13Jul 31, 2025Updated 10 months ago
- A DeepLearn Model to rec Math formula. 一个深度学习库用来识别数学公式 数式を識別するためのディープラーニング ライブラ リ☆27Mar 2, 2025Updated last year
- This code is a version of implement of the essay named Deep Inception Networks: A General End-to-End Framework for Multi-asset Quantitati…☆13Mar 15, 2024Updated 2 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- nanodet_rknn on rk3399pro platform☆17Apr 17, 2022Updated 4 years ago