DorothyDUUU / SWE-DevLinks
Official code space for "SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development"
☆29Updated 3 weeks ago
Alternatives and similar repositories for SWE-Dev
Users that are interested in SWE-Dev are comparing it to the libraries listed below
Sorting:
- MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆13Updated 4 months ago
- A paper list for spatial reasoning☆94Updated 2 weeks ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆119Updated last month
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆17Updated last month
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆64Updated 3 weeks ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆34Updated 7 months ago
- VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆51Updated 2 weeks ago
- Implementation of the MATRIX framework (ICML 2024)☆54Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆22Updated 10 months ago
- ICLR2024 statistics☆47Updated last year
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆35Updated last month
- Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆18Updated 7 months ago
- Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆33Updated this week
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆22Updated 4 months ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆29Updated 7 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆63Updated 3 weeks ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆45Updated 4 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 5 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆124Updated 5 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆41Updated 2 months ago
- ☆15Updated last year
- TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆51Updated last week
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆55Updated 10 months ago
- CVPR25☆22Updated 3 months ago
- This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"☆49Updated 8 months ago
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆18Updated 10 months ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆17Updated 2 months ago
- Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness☆43Updated last year
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆65Updated 2 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆49Updated 2 months ago