gcanat / video_reader-rsLinks
A library to fastly decode video with ffmpeg and rust
☆112Updated this week
Alternatives and similar repositories for video_reader-rs
Users that are interested in video_reader-rs are comparing it to the libraries listed below
Sorting:
- GPU based FFT written in Rust and CubeCL☆29Updated last month
- Blazingly fast inference of diffusion models.☆119Updated 10 months ago
- [ECCV 2024 & NeurIPS 2024 & ICLR 2026] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆270Updated last week
- A framework for building high-performance real-time multiple object trackers☆256Updated 10 months ago
- A diffusers API in Burn (Rust)☆25Updated last week
- 🦀 Low-level 3D Computer Vision library in Rust☆571Updated this week
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Updated 10 months ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆40Updated last year
- Scaling Vision Pre-Training to 4K Resolution☆221Updated last month
- Asynchronous TensorRT for Rust.☆40Updated 4 months ago
- Savant Library with new generation primitives re-implemented in Rust☆19Updated this week
- Megvii FILE Library - Working with Files in Python same as the standard library☆168Updated 3 weeks ago
- ☆40Updated 3 months ago
- [ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.☆34Updated last month
- ☆33Updated 2 weeks ago
- Low rank adaptation (LoRA) for Candle.☆169Updated 9 months ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆11Updated last year
- A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models such as YOLO, FastVLM, an…☆346Updated this week
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆153Updated last year
- ☆14Updated last year
- Official Implementation for our NeurIPS 2024 paper, "Don't Look Twice: Run-Length Tokenization for Faster Video Transformers".☆234Updated 10 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆40Updated last year
- ESRGAN implemented in rust with candle☆17Updated 2 years ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- [ICCV'2025 Highlight] MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation☆84Updated last month
- ObjCtrl-2.5D☆58Updated 10 months ago
- [ICLR 2026] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation☆260Updated last week
- Cosmos-Curate is a powerful video curation system that processes, analyzes, and organizes video content using advanced AI models and dist…☆143Updated 3 weeks ago
- ☆213Updated 11 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆77Updated 8 months ago