xgen-universe/Capybara

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xgen-universe/Capybara)

xgen-universe / Capybara

☆202

Alternatives and similar repositories for Capybara

Users that are interested in Capybara are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DuNGEOnmassster / VideoGen-of-Thought
View on GitHub
[Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minim…
☆63Sep 22, 2025Updated 10 months ago
Tencent-Hunyuan / SAGE-GRPO
View on GitHub
Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation
☆126Apr 2, 2026Updated 3 months ago
ZhuWenjie98 / DDE
View on GitHub
(ECCV2026) Dual Distribution Estimation for Zero-shot Noisy Test-Time Adaptation with VLMs
☆15Jul 2, 2026Updated 3 weeks ago
boogu-project / Boogu-Image
View on GitHub
Boogu-Image-0.1 is an Apache-2.0 open-source image generation and editing model family that delivers near-closed-source performance with …
☆833Updated this week
iGuoYanjun / Memorize-When-Needed
View on GitHub
☆23Jun 29, 2026Updated 3 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 2 months ago
KlingAIResearch / UniVideo
View on GitHub
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
☆541Jul 3, 2026Updated 3 weeks ago
SAIS-FUXI / Omni-Video
View on GitHub
☆157Feb 28, 2026Updated 4 months ago
PeiwenSun2000 / X-Stream
View on GitHub
Official Repo of "$X$-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding"
☆33Jun 18, 2026Updated last month
MC-E / InstructX
View on GitHub
☆86Oct 10, 2025Updated 9 months ago
showlab / Kiwi-Edit
View on GitHub
A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.
☆306May 13, 2026Updated 2 months ago
GeekGuru123 / ProfilingDiT
View on GitHub
☆20Jan 1, 2026Updated 6 months ago
SOTAMak1r / VINO-code
View on GitHub
A Unified Visual Generator with Interleaved OmniModal Context
☆232Mar 5, 2026Updated 4 months ago
lslrh / SyncNoise
View on GitHub
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing
☆19Dec 28, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JaydenLyh / Reward-Forcing
View on GitHub
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆352Dec 15, 2025Updated 7 months ago
MeiGen-AI / Infinite-World
View on GitHub
[ICML 2026] | Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
☆195May 4, 2026Updated 2 months ago
byhuang123 / PoCo
View on GitHub
[CVPR2026] Official implementation of our paper “Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot…
☆19Apr 8, 2026Updated 3 months ago
PolyU-VCLab / DepthMaster
View on GitHub
DepthMaster: Unified Monocular Depth Estimation for Perspective and Panoramic Images
☆25Jun 13, 2026Updated last month
Multimedia-Analytics-Laboratory / dpdmd
View on GitHub
[ICML 2026] The offical code of Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
☆87Jun 2, 2026Updated last month
EnVision-Research / ScalingAR
View on GitHub
[ICML 2026] ScalingAR: Scaling Confidence for Autoregressive Image Generation
☆22May 5, 2026Updated 2 months ago
csslc / Self-Transcendence
View on GitHub
[ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…
☆37Jul 3, 2026Updated 3 weeks ago
MinghanLi / FiVE-Bench
View on GitHub
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
☆38Apr 2, 2026Updated 3 months ago
Advocate99 / AssetFormer
View on GitHub
[ICLR'2026] AssetFormer: Modular 3D Assets Generation with Autoregressive Transformer
☆37Feb 13, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
KlingAIResearch / VideoCanvas
View on GitHub
Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"
☆68Oct 10, 2025Updated 9 months ago
cz-5f / LoVoRA.github.io
View on GitHub
☆41Dec 18, 2025Updated 7 months ago
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆486Sep 24, 2025Updated 10 months ago
SupstarZh / VividDreamer
View on GitHub
[ECCV2024] VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
☆10Jul 4, 2024Updated 2 years ago
Yaofang-Liu / Pusa-VidGen
View on GitHub
Pusa: Thousands Timesteps Video Diffusion Model
☆686Feb 13, 2026Updated 5 months ago
DwanZhang-AI / SePPO
View on GitHub
Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."
☆18Oct 7, 2024Updated last year
Yubo-Shankui / Bind-Your-Avatar-Implementation
View on GitHub
(CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…
☆34Apr 7, 2026Updated 3 months ago
matrix-agent / awesome-agentic-world-modeling
View on GitHub
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond
☆286Jun 27, 2026Updated 3 weeks ago
laulampaul / text-animator
View on GitHub
☆20Jun 26, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
XianfengWu01 / LightGen
View on GitHub
An Efficient Text-to-Image Generation Pretrain Pipeline
☆132Apr 18, 2025Updated last year
ShawnChenn / FlexibleReflectionRemoval
View on GitHub
AAAI 25' Flexible Image Reflection Removal with Sparse Human Guidance
☆12Jul 7, 2025Updated last year
xiechenxi99 / DNAEdit_code
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation for DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing
☆32Jan 23, 2026Updated 6 months ago
Liangsanzhu / Photo3D
View on GitHub
Photo3D: Advancing Photorealistic 3D Generation through Structure‑Aligned Detail Enhancement
☆22Mar 18, 2026Updated 4 months ago
feizc / Ingredients
View on GitHub
Blending Custom Photos with Video Diffusion Transformers
☆50Jan 21, 2025Updated last year
MrPeterJin / gpu_grabber
View on GitHub
A script for checking availability of GPUs and runs your scripts during peak times
☆10Feb 24, 2023Updated 3 years ago
LAW1223 / AlignVid
View on GitHub
☆23May 29, 2026Updated last month