Aaronhuang-778 / MC-MoE
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More
☆20Updated last month
Related projects ⓘ
Alternatives and complementary repositories for MC-MoE
- Denoising Diffusion Step-aware Models (ICLR2024)☆52Updated 9 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated last month
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆57Updated last month
- Empowering Unified MLLM with Multi-granular Visual Generation☆104Updated 3 weeks ago
- 😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D☆24Updated 4 months ago
- VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆129Updated 2 weeks ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 9 months ago
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detecti…☆110Updated last month
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆43Updated 5 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆40Updated 4 months ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆21Updated 7 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆44Updated 2 weeks ago
- This is the official implementation for ControlVAR.☆52Updated last month
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆74Updated 7 months ago
- [CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D De…☆84Updated 2 months ago
- Implements VAR+CLIP for image generation☆78Updated 3 months ago
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆51Updated 7 months ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆40Updated last month
- The paper collections for the autoregressive models in vision.☆101Updated this week
- DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆113Updated 5 months ago
- ☆45Updated 5 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆43Updated last month
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆53Updated last year
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆96Updated 6 months ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- ☆32Updated 3 weeks ago
- Code of 3DMIT: 3D MULTI-MODAL INSTRUCTION TUNING FOR SCENE UNDERSTANDING☆24Updated 3 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆52Updated last month
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆64Updated 6 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆35Updated 3 weeks ago