☆198Feb 27, 2026Updated 3 months ago
Alternatives and similar repositories for Capybara
Users that are interested in Capybara are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy", MICCAI 2…☆12Jan 29, 2026Updated 4 months ago
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆27Jul 7, 2024Updated last year
- Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…☆17Jul 7, 2024Updated last year
- A script for checking availability of GPUs and runs your scripts during peak times☆10Feb 24, 2023Updated 3 years ago
- Official implementation of “LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusio…☆21Jul 7, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repository for paper Temporal consistency learning for video super-resolution.☆12Apr 27, 2022Updated 4 years ago
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)☆24Sep 24, 2023Updated 2 years ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆32Mar 17, 2026Updated 2 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆57Mar 28, 2026Updated 2 months ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆67Mar 27, 2023Updated 3 years ago
- S2ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation (MICCAI 2023)☆21Dec 1, 2023Updated 2 years ago
- Official Repository for the ICML 2023 paper "BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning"☆16Oct 11, 2023Updated 2 years ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICCV,2023]Sample-adaptive Augmentation for Point Cloud Recognition Against Real-world Corruptions☆19Dec 25, 2023Updated 2 years ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆39Nov 11, 2025Updated 7 months ago
- ☆22Apr 4, 2022Updated 4 years ago
- ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models☆18Aug 9, 2024Updated last year
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆40Dec 13, 2025Updated 6 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆79Oct 15, 2024Updated last year
- The implementation of "Learning Single Image Defocus Deblurring with Misaligned Training Pairs".☆21Nov 27, 2022Updated 3 years ago
- ☆20Jul 14, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ECCV 2024] Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures☆34Oct 28, 2024Updated last year
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆58May 3, 2026Updated last month
- ☆15Nov 3, 2022Updated 3 years ago
- CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"☆163Feb 22, 2026Updated 3 months ago
- The code for "Toward Accurate and Temporally Consistent Video Restoration from Raw Data"☆16Dec 25, 2023Updated 2 years ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆37Feb 26, 2025Updated last year
- Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"☆12Apr 4, 2022Updated 4 years ago
- [CVPR 2021] High-Fidelity and Arbitrary Face Editing☆20Jun 27, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pr…☆28Jul 31, 2022Updated 3 years ago
- ☆38Jul 24, 2025Updated 10 months ago
- [ICLR 2026] - One2Scene☆43May 25, 2026Updated 2 weeks ago
- ☆13Jul 10, 2024Updated last year
- a practicable Pytorch framework used in Deep Learning.☆25Feb 27, 2025Updated last year
- Project page of "GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors"☆23Jul 1, 2024Updated last year
- Consistent Autoregressive Video Generation with Long Context☆88Feb 6, 2026Updated 4 months ago