seth-lu / Im2win
☆13Updated last year
Alternatives and similar repositories for Im2win:
Users that are interested in Im2win are comparing it to the libraries listed below
- MLPerf™ Mobile models☆24Updated 3 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆21Updated 3 weeks ago
- ☆9Updated last year
- The official repository of Quamba☆14Updated 2 months ago
- ☆20Updated 2 years ago
- Triton kernels for Flux☆17Updated 2 weeks ago
- Explore training for quantized models☆12Updated last week
- Benchmarks to capture important workloads.☆29Updated this week
- A Winograd Minimal Filter Implementation in CUDA☆23Updated 3 years ago
- FlexAttention w/ FlashAttention3 Support☆27Updated 3 months ago
- TensorRT LLM Benchmark Configuration☆12Updated 5 months ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆28Updated 3 months ago
- Dynamic Neural Architecture Search Toolkit☆29Updated last month
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆47Updated this week
- NASRec Weight Sharing Neural Architecture Search for Recommender Systems☆29Updated last year
- ☆16Updated last month
- ☆43Updated 7 months ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆31Updated last year
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 10 months ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 4 years ago
- GEMM and Winograd based convolutions using CUTLASS☆26Updated 4 years ago
- ☆11Updated 4 months ago
- TAO Toolkit deep learning networks with TensorFlow 1.x backend☆13Updated 11 months ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)☆29Updated 5 months ago
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- ☆62Updated last month
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆57Updated this week
- Standalone Flash Attention v2 kernel without libtorch dependency☆99Updated 4 months ago
- ☆12Updated 3 years ago