☆56Jan 30, 2026Updated 5 months ago
Alternatives and similar repositories for World-Craft
Users that are interested in World-Craft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆21Nov 1, 2025Updated 8 months ago
- Implementation of few-shot baseline for MedFMC☆18Nov 27, 2023Updated 2 years ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆64Jul 5, 2025Updated 11 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆90May 8, 2026Updated last month
- [ICLR 2026🔥] MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head☆149May 19, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆37Sep 5, 2024Updated last year
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆106Oct 8, 2025Updated 8 months ago
- ☆24May 16, 2026Updated last month
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆127Aug 10, 2025Updated 10 months ago
- ☆11Jan 18, 2024Updated 2 years ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆51Mar 25, 2025Updated last year
- ☆34Apr 1, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Sep 19, 2021Updated 4 years ago
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Jan 8, 2025Updated last year
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆88Jul 26, 2025Updated 11 months ago
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆26Dec 9, 2024Updated last year
- All code for FlairGPT: Repurposing LLMs for Interior Designs, Eurographics 2025☆21Mar 6, 2025Updated last year
- Adapting LLaMA Decoder to Vision Transformer☆30May 20, 2024Updated 2 years ago
- A small library of 3D related utilities used in my research.☆10Mar 5, 2022Updated 4 years ago
- The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》☆31Mar 12, 2024Updated 2 years ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23Sep 29, 2022Updated 3 years ago
- The official implementation of Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation.☆17Sep 19, 2022Updated 3 years ago
- Make Your Training Flexible: Towards Deployment-Efficient Video Models☆40Jun 11, 2025Updated last year
- Our CVPR 2022 paper, iPLAN: Interactive and Procedural Layout Planning