Stability-AI / facexlib
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
☆8Updated last year
Related projects: ⓘ
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆47Updated 2 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆43Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆15Updated 3 months ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated last year
- ☆64Updated 8 months ago
- ☆30Updated 3 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆25Updated 3 months ago
- Export utility for unconstrained channel pruned models☆66Updated last year
- example of using CoreML from c++☆21Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆19Updated 2 months ago
- An object detection codebase based on MegEngine.☆28Updated last year
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆13Updated last year
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆65Updated 11 months ago
- ☆34Updated 3 months ago
- Stable Diffusion in TensorRT 8.5+☆14Updated last year
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆77Updated 11 months ago
- ☆56Updated 6 months ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆19Updated 6 months ago
- ☆9Updated last year
- Patch convolution to avoid large GPU memory usage of Conv2D☆73Updated 3 months ago
- The Triton backend for TensorRT.☆59Updated last week
- EdgeSAM model for use with Autodistill.☆24Updated 3 months ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆101Updated last year
- [CVPR-2023] Towards Any Structural Pruning☆17Updated last year
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆24Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆12Updated last month
- ONNX and TensorRT implementation of Whisper☆55Updated last year