mulingcloud / mlcbase
The base module of all MuLingCloud modules and applications.
☆10Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for mlcbase
- A collection of awesome text-to-image generation studies.☆431Updated this week
- A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC☆447Updated last week
- The paper collections for the autoregressive models in vision.☆233Updated this week
- Diffusion Model-Based Image Editing: A Survey (arXiv)☆487Updated last week
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆302Updated last month
- a brief repo about paper research☆13Updated 2 months ago
- A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems☆202Updated 2 months ago
- A collection of awesome video generation studies.☆350Updated this week
- A collection of awesome image inpainting studies.☆173Updated this week
- [ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"☆683Updated 3 months ago
- ☆38Updated 3 months ago
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆466Updated last month
- Official implementation for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way☆18Updated last month
- 🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).☆364Updated last week
- A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..☆461Updated this week
- ☆838Updated 4 months ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆239Updated last month
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆213Updated this week
- Papers and codes collection for customized, personalized and editable generative models☆23Updated last month
- A paper list of some recent works about Token Compress for Vit and VLM☆149Updated this week
- ☆11Updated 2 months ago
- Collection of recent methods on 3D Scene Generation from Text Description.☆11Updated 3 weeks ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆290Updated 3 months ago
- Some basic topics in the field of deep learning, including papers, notes and codes, etc., hope to be helpful to later people.☆20Updated 5 months ago
- 中科大数字图像分析(周文罡、李厚强等)2022秋学期复习资料☆17Updated last year
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆115Updated last month
- ☆223Updated 8 months ago
- [CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want☆711Updated 3 months ago
- ☆62Updated this week
- Diffusion Feedback Helps CLIP See Better☆216Updated 3 months ago