InsNet Runs Instance-dependent Neural Networks with Padding-free Dynamic Batching.
☆67Nov 20, 2021Updated 4 years ago
Alternatives and similar repositories for InsNet
Users that are interested in InsNet are comparing it to the libraries listed below
Sorting:
- a simple API to use CUPTI☆11Aug 19, 2025Updated 6 months ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- ☆12Mar 13, 2023Updated 2 years ago
- ☆11Apr 5, 2021Updated 4 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- a single-header math library☆17Nov 7, 2025Updated 3 months ago
- ☆13Jul 6, 2018Updated 7 years ago
- A tracing JIT for PyTorch☆17Aug 29, 2022Updated 3 years ago
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- OneFlow Serving☆21Apr 10, 2025Updated 10 months ago
- CUDA 12.2 HMM demos☆20Jul 26, 2024Updated last year
- ☆24Jan 30, 2026Updated last month
- Surrogate-based Hyperparameter Tuning System☆28Jun 29, 2023Updated 2 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,016Oct 4, 2018Updated 7 years ago
- ☆61Nov 27, 2023Updated 2 years ago
- Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing…☆37Jan 15, 2026Updated last month
- ☆29Oct 3, 2022Updated 3 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆71Mar 11, 2022Updated 3 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Do NLP without coding! Simple NLP framework.☆22Sep 11, 2022Updated 3 years ago
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆41May 13, 2025Updated 9 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Apr 27, 2023Updated 2 years ago
- the implementation of yolov2 by tensorflow☆26Aug 7, 2018Updated 7 years ago
- Yet Another Neural Network Library 🤔☆27Updated this week
- Rate My Plate, Twitter Bot to rate the environmental impact of your food.☆10Mar 8, 2017Updated 8 years ago
- 【python】初體驗-俄羅斯方塊遊戲☆11Apr 4, 2020Updated 5 years ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- Real Time OCR Web App (React, NodeJS, Python and AWS)☆12Jan 5, 2023Updated 3 years ago
- Prefix-Aware Attention for LLM Decoding☆29Jan 23, 2026Updated last month
- Support Vector Machine (SVM) library for Python with GPU☆31Jun 7, 2018Updated 7 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Jan 9, 2023Updated 3 years ago
- A Keras inspired training utility for PyTorch☆38Sep 13, 2018Updated 7 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Place for meetup slides☆140Oct 11, 2020Updated 5 years ago
- Framework to reduce autotune overhead to zero for well known deployments.☆97Sep 19, 2025Updated 5 months ago