InsNet Runs Instance-dependent Neural Networks with Padding-free Dynamic Batching.
☆67Nov 20, 2021Updated 4 years ago
Alternatives and similar repositories for InsNet
Users that are interested in InsNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 3 years ago
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- A Light CNN Framework!☆16Apr 8, 2019Updated 7 years ago
- Yet another Polyhedra Compiler for DeepLearning☆19Apr 14, 2023Updated 3 years ago
- ☆11Apr 5, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Mar 13, 2023Updated 3 years ago
- a single-header math library☆17Nov 7, 2025Updated 6 months ago
- Transformers components but in Triton☆34May 9, 2025Updated last year
- Fork of LLVM Project containing a Colossus IPU backend implementation☆14Mar 11, 2026Updated 2 months ago
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- A tracing JIT for PyTorch☆17Aug 29, 2022Updated 3 years ago
- Do NLP without coding! Simple NLP framework.☆22Sep 11, 2022Updated 3 years ago
- ☆67Nov 27, 2023Updated 2 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OneFlow Serving☆20Apr 10, 2025Updated last year
- CUDA 12.2 HMM demos☆21Jul 26, 2024Updated last year
- Runtimex package help to expose Go Runtime internals representation safely.☆12Feb 19, 2025Updated last year
- Chess engine in C++☆10Feb 9, 2026Updated 3 months ago
- ☆11Jul 10, 2022Updated 3 years ago
- Chrome extension for OA sites like arxiv, openreivew: 1. PDF back to abstract page, 2. Rename PDF page with paper title.☆18Oct 12, 2023Updated 2 years ago
- ☆16Dec 1, 2024Updated last year
- Place for meetup slides☆139Oct 11, 2020Updated 5 years ago
- [NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…☆10Feb 13, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Surrogate-based Hyperparameter Tuning System☆30Jun 29, 2023Updated 2 years ago
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- An implementation of AutoScale regression-based method☆12Oct 27, 2020Updated 5 years ago
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Aug 31, 2020Updated 5 years ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 6 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- LOUDS-trie implementation example (C++)☆15Nov 27, 2019Updated 6 years ago
- A GPU performance profiling tool for PyTorch models☆22Jul 5, 2022Updated 3 years ago
- Digital Design Lab Spring 2019 Final Project☆13Jun 17, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A chrome extension to embrace your Dark Side!☆11Feb 14, 2021Updated 5 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- ☆30Oct 3, 2022Updated 3 years ago
- Simple Dynamic Batching Inference☆144Mar 8, 2022Updated 4 years ago
- Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing…☆36Jan 15, 2026Updated 4 months ago
- How and why you want to make your pytorch CUDA/CPP extension with a Makefile☆172Jul 3, 2019Updated 6 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago