OwLite is a low-code AI model compression toolkit for AI models.
☆53Nov 14, 2025Updated 7 months ago
Alternatives and similar repositories for owlite
Users that are interested in owlite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆57Jul 16, 2025Updated 11 months ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆123Mar 6, 2024Updated 2 years ago
- How to deploy CenterNet models using DeepStream☆12Feb 1, 2022Updated 4 years ago
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆25Apr 15, 2025Updated last year
- vLLM plugin for RBLN NPU☆50Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Yet Another JSON Parser in Python☆11Mar 1, 2023Updated 3 years ago
- ☆31Sep 18, 2017Updated 8 years ago
- ☆22Jun 10, 2025Updated last year
- [CVPR 2025] Efficient Personalization of Quantized Diffusion Model without Backpropagation☆17Mar 31, 2025Updated last year
- ☆55Nov 22, 2022Updated 3 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- ☆12Aug 6, 2024Updated last year
- ☆21May 14, 2026Updated last month
- Deep Learning Visualization Tools Using PyTorch☆11Feb 2, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆35Nov 21, 2022Updated 3 years ago
- 1st ranked light-weight face verification model for AI Challenge 2020, hosted by MSIT Korea.☆12Jul 28, 2020Updated 5 years ago
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…☆32Apr 14, 2026Updated 2 months ago
- Tensorflow implementation of MAMNet☆10Apr 2, 2020Updated 6 years ago
- How to create, train and quantize network, then integrate it into pre/post image processing and generate CUDA C++ code for targeting Jets…☆12May 7, 2025Updated last year
- 🇰🇷 Korean language skills for AI agents☆52May 5, 2026Updated last month
- 프론트 개발자 여자친구의 특별한 생일 이벤트 프로젝트 🎉☆10Oct 23, 2023Updated 2 years ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆33Oct 20, 2025Updated 7 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆14Jan 8, 2026Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 한림대학교 오픈소스 SW 교육센터☆11Jan 30, 2020Updated 6 years ago
- [ICLR2025] Code and data for paper: Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasonin…☆43Mar 10, 2025Updated last year
- Progressive Growing of Points with Tree-structured Generators (BMVC 2021)☆11Nov 1, 2023Updated 2 years ago
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- ☆15Jul 24, 2025Updated 10 months ago
- This repository is the official implementation of DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Re…☆39Mar 27, 2022Updated 4 years ago
- State of the art 84.7% accuracy on SleepEDF-78 and 88.4% SHHS Datasset☆10Apr 28, 2025Updated last year
- my submitted project to UoL Programming with Data module☆12Jan 5, 2023Updated 3 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆11Oct 24, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Robust multimodal integration method implemented in PyTorch and TensorFlow☆88Mar 5, 2021Updated 5 years ago
- ☆17Jun 17, 2024Updated last year
- ☆15Oct 12, 2020Updated 5 years ago
- Vitis-AI 1.3 TensorFlow2 flow with a custom CNN model, targeted ZCU102 evaluation board.☆15Apr 6, 2021Updated 5 years ago
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Aug 31, 2020Updated 5 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆88Jun 5, 2026Updated last week
- Korea Meteorological Administration☆15Jan 18, 2023Updated 3 years ago