☆181Feb 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for Capybara
Users that are interested in Capybara are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy", MICCAI 2…☆11Jan 29, 2026Updated last month
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆24Jul 7, 2024Updated last year
- Official implementation of “LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusio…☆20Jul 7, 2024Updated last year
- Repository for paper Temporal consistency learning for video super-resolution.☆12Apr 27, 2022Updated 3 years ago
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)☆24Sep 24, 2023Updated 2 years ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆28Mar 17, 2026Updated last week
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆52Updated this week
- ☆53Dec 10, 2025Updated 3 months ago
- S2ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation (MICCAI 2023)☆20Dec 1, 2023Updated 2 years ago
- The IP-Adapter training scripts and inference for Flux Model, which is implemented based on X-Lab☆17Oct 1, 2024Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆34Nov 11, 2025Updated 4 months ago
- ☆22Apr 4, 2022Updated 3 years ago
- ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models☆18Aug 9, 2024Updated last year
- [AAAI'2025] Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting☆36Jan 4, 2026Updated 2 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆78Oct 15, 2024Updated last year
- The implementation of "Learning Single Image Defocus Deblurring with Misaligned Training Pairs".☆21Nov 27, 2022Updated 3 years ago
- ☆19Jul 14, 2024Updated last year
- [ECCV 2024] Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures☆33Oct 28, 2024Updated last year
- CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework" 🔥☆132Feb 22, 2026Updated last month
- Hand tracking with Mediapipe☆19Feb 16, 2024Updated 2 years ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆37Feb 26, 2025Updated last year
- Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"☆12Apr 4, 2022Updated 3 years ago
- ☆13Jun 14, 2023Updated 2 years ago
- [CVPR 2021] High-Fidelity and Arbitrary Face Editing☆20Jun 27, 2021Updated 4 years ago
- [CVPR 2025] ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting☆33Dec 5, 2024Updated last year
- ☆34Jul 24, 2025Updated 8 months ago
- Official code repository of Laplacian Pyramid Pansharpening Network☆26May 28, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- a practicable Pytorch framework used in Deep Learning.☆25Feb 27, 2025Updated last year
- A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.☆199Mar 11, 2026Updated last week
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆78Feb 26, 2026Updated 3 weeks ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Jul 19, 2024Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆45Aug 1, 2024Updated last year
- ☆11Feb 28, 2019Updated 7 years ago
- 自建生成二维码接口☆11Sep 27, 2023Updated 2 years ago
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…☆36Jul 26, 2024Updated last year