OwLite is a low-code AI model compression toolkit for AI models.
☆54Nov 14, 2025Updated 5 months ago
Alternatives and similar repositories for owlite
Users that are interested in owlite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆56Jul 16, 2025Updated 9 months ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆122Mar 6, 2024Updated 2 years ago
- How to deploy CenterNet models using DeepStream☆12Feb 1, 2022Updated 4 years ago
- ☆90Mar 28, 2024Updated 2 years ago
- [ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks☆41Feb 4, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Yet Another JSON Parser in Python☆11Mar 1, 2023Updated 3 years ago
- [NeurIPS 2025] Multipole Attention for Efficient Long Context Reasoning☆23Dec 5, 2025Updated 5 months ago
- ☆55Nov 22, 2022Updated 3 years ago
- 거꾸로 읽는 self-supervised learning 파트 1☆48Oct 30, 2022Updated 3 years ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated 2 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- ☆20May 3, 2025Updated last year
- Deep Learning Visualization Tools Using PyTorch☆11Feb 2, 2021Updated 5 years ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 🇰🇷 Korean language skills for AI agents☆31Apr 26, 2026Updated last week
- 1st ranked light-weight face verification model for AI Challenge 2020, hosted by MSIT Korea.☆12Jul 28, 2020Updated 5 years ago
- Tensorflow implementation of MAMNet☆10Apr 2, 2020Updated 6 years ago
- How to create, train and quantize network, then integrate it into pre/post image processing and generate CUDA C++ code for targeting Jets…☆12May 7, 2025Updated 11 months ago
- ☆13Mar 5, 2024Updated 2 years ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆33Oct 20, 2025Updated 6 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆14Jan 8, 2026Updated 3 months ago
- [ICLR2025] Code and data for paper: Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasonin…☆42Mar 10, 2025Updated last year
- 한림대학교 오픈소스 SW 교육센터☆11Jan 30, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Oct 6, 2022Updated 3 years ago
- ☆14Aug 19, 2024Updated last year
- A terminal for JupyterLite.☆25Apr 15, 2026Updated 3 weeks ago
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- Updated folk of g2pk☆13Aug 18, 2023Updated 2 years ago
- Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)☆17May 16, 2022Updated 3 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- this repository includes code to test speed capabilities of GPUs and CPUs☆17Apr 28, 2024Updated 2 years ago
- This repository is the official implementation of DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Re…☆39Mar 27, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- my submitted project to UoL Programming with Data module☆12Jan 5, 2023Updated 3 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆10Oct 24, 2023Updated 2 years ago
- Robust multimodal integration method implemented in PyTorch and TensorFlow☆88Mar 5, 2021Updated 5 years ago
- ☆15Oct 12, 2020Updated 5 years ago
- Vitis-AI 1.3 TensorFlow2 flow with a custom CNN model, targeted ZCU102 evaluation board.☆15Apr 6, 2021Updated 5 years ago
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Aug 31, 2020Updated 5 years ago