OwLite is a low-code AI model compression toolkit for AI models.
☆53Nov 14, 2025Updated 4 months ago
Alternatives and similar repositories for owlite
Users that are interested in owlite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆55Jul 16, 2025Updated 8 months ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆120Mar 6, 2024Updated 2 years ago
- How to deploy CenterNet models using DeepStream☆12Feb 1, 2022Updated 4 years ago
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆25Apr 15, 2025Updated 11 months ago
- [ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks☆39Feb 4, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- vLLM plugin for RBLN NPU☆44Updated this week
- ☆22Jun 10, 2025Updated 9 months ago
- ☆55Nov 22, 2022Updated 3 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆30Nov 22, 2025Updated 4 months ago
- 프론트 개발자 여자친구의 특별한 생일 이벤트 프로젝트 🎉☆10Oct 23, 2023Updated 2 years ago
- ☆13Mar 5, 2024Updated 2 years ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆31Oct 20, 2025Updated 5 months ago
- ☆17Mar 28, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR2025] Code and data for paper: Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasonin…☆40Mar 10, 2025Updated last year
- ☆14Aug 19, 2024Updated last year
- A terminal for JupyterLite.☆24Updated this week
- Updated folk of g2pk☆13Aug 18, 2023Updated 2 years ago
- This repository is the official implementation of DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Re…☆39Mar 27, 2022Updated 3 years ago
- Robust multimodal integration method implemented in PyTorch and TensorFlow☆87Mar 5, 2021Updated 5 years ago
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Aug 31, 2020Updated 5 years ago
- Code for the ICLR2020 "Training Binary Neural Networks with Real-to-Binary Convolutions☆34Jun 16, 2020Updated 5 years ago
- Study parallel programming - CUDA, OpenMP, MPI, Pthread☆64Jul 3, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆43Nov 15, 2022Updated 3 years ago
- Implementation of the paper 'Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance' (EMNLP 2025)☆28Dec 16, 2025Updated 3 months ago
- Website for CSE 234, Winter 2025☆13Mar 24, 2025Updated last year
- ☆56Nov 14, 2024Updated last year
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- ☆18Aug 23, 2024Updated last year
- Explainable vision transformer for automatic visual sleep staging on multimodal PSG signals☆18Dec 23, 2024Updated last year
- Smoothing video traffic to make it a friendlier internet neighbor☆14Apr 23, 2024Updated last year
- This is my internship project in NetEase game AI Lab: multi-modal virtual human interaction, through the text to predict the virtual huma…☆10Nov 5, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆23Oct 26, 2019Updated 6 years ago
- Official MICCAI 2022 Federated Learning for Healthcare Tutorial Repo☆14Dec 2, 2024Updated last year
- Generic library for neural collapse and several derivative works on the phenomenon.☆18Apr 14, 2025Updated 11 months ago
- POSTECH: Compiler Construction (Spring 2022)☆11Mar 10, 2023Updated 3 years ago
- ☆37Nov 14, 2025Updated 4 months ago
- The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift☆45Dec 8, 2022Updated 3 years ago
- ☆13May 11, 2023Updated 2 years ago