☆143Feb 20, 2026Updated last week
Alternatives and similar repositories for Capybara
Users that are interested in Capybara are comparing it to the libraries listed below
Sorting:
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)☆24Sep 24, 2023Updated 2 years ago
- Official implementation of "EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy", MICCAI 2…☆11Jan 29, 2026Updated last month
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆24Jul 7, 2024Updated last year
- Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…☆17Jul 7, 2024Updated last year
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆20Feb 23, 2025Updated last year
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models☆18Aug 9, 2024Updated last year
- ☆22Apr 4, 2022Updated 3 years ago
- Official implementation of “LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusio…☆20Jul 7, 2024Updated last year
- ☆19Jul 14, 2024Updated last year
- The implementation of "Learning Single Image Defocus Deblurring with Misaligned Training Pairs".☆21Nov 27, 2022Updated 3 years ago
- ☆53Dec 10, 2025Updated 2 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated 10 months ago
- [CVPR 2021] High-Fidelity and Arbitrary Face Editing☆20Jun 27, 2021Updated 4 years ago
- Project page of "GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors"☆23Jul 1, 2024Updated last year
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆71Feb 22, 2026Updated last week
- Consistent Autoregressive Video Generation with Long Context☆67Feb 6, 2026Updated 3 weeks ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Jul 19, 2024Updated last year
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆78Oct 15, 2024Updated last year
- [AAAI'2025] Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting☆35Jan 4, 2026Updated last month
- Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pr…☆28Jul 31, 2022Updated 3 years ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆37Feb 26, 2025Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆16Nov 19, 2025Updated 3 months ago
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆208Feb 13, 2026Updated 2 weeks ago
- Bringing Events into Video Deblurring with Non consecutively Blurry Frames (ICCV2021)☆39Jan 14, 2022Updated 4 years ago
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation☆367Jul 4, 2025Updated 7 months ago
- Ofiicial GoodDrag implementation.☆97Sep 25, 2025Updated 5 months ago
- [arXiv 2023] Improving Image Restoration through Removing Degradations in Textual Representations☆94Apr 17, 2024Updated last year
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- ☆10Jun 28, 2023Updated 2 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆21Feb 11, 2026Updated 2 weeks ago
- a Video Quality Analysis Toolkit☆13May 16, 2025Updated 9 months ago
- Information Extraction related tools and models☆10Mar 16, 2023Updated 2 years ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆45Aug 1, 2024Updated last year
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago