mllm-npu: training multimodal large language models on Ascend NPUs
☆95Aug 29, 2024Updated last year
Alternatives and similar repositories for mllm-npu
Users that are interested in mllm-npu are comparing it to the libraries listed below
Sorting:
- ☆14Nov 19, 2024Updated last year
- ☆17Nov 17, 2023Updated 2 years ago
- [NeurIPS 2023] CircuitFormer: Circuit as Set of Points☆38Nov 22, 2023Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Feb 8, 2023Updated 3 years ago
- [IJCV 2024]☆21Nov 11, 2024Updated last year
- ☆59May 13, 2025Updated 9 months ago
- ☆12Sep 24, 2024Updated last year
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆58Apr 8, 2025Updated 10 months ago
- Official github repo of G-LLaVA☆148Feb 20, 2025Updated last year
- Caffe++: assemble new features to enhance Caffe☕️☆11Dec 24, 2018Updated 7 years ago
- Multimodal Models in Real World☆556Feb 24, 2025Updated last year
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Nov 19, 2025Updated 3 months ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- Source code of ICLR2020 submisstion: Zeno++: Robust Fully Asynchronous SGD☆14Feb 2, 2020Updated 6 years ago
- EVE Series: Encoder-Free Vision-Language Models from BAAI☆368Jul 24, 2025Updated 7 months ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Jul 21, 2023Updated 2 years ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated last month
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Aug 11, 2022Updated 3 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆109Jun 17, 2025Updated 8 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Sep 12, 2023Updated 2 years ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆642Sep 21, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆997Nov 25, 2025Updated 3 months ago
- [ECCV 2024] Occupancy as Set of Points☆91Jul 8, 2024Updated last year
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆65Sep 28, 2023Updated 2 years ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- the C++ version of thundernet with ncnn☆14Feb 20, 2021Updated 5 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆34Apr 9, 2022Updated 3 years ago
- [IJCV 2025] MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆77May 30, 2025Updated 9 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆159Dec 6, 2024Updated last year
- The project is based on pytorch and integrates the current mainstream network architecture, including VGGnet, ResNet, Densenet, MobileNet…☆16Dec 24, 2022Updated 3 years ago
- Knowledge Distillation Toolbox for Semantic Segmentation☆17Nov 20, 2022Updated 3 years ago
- Code for "Bezier Everywhere All at Once: Learning Drivable Lanes as Bezier Graphs".☆24Mar 29, 2024Updated last year
- ☆21Jan 17, 2025Updated last year
- ☆21Feb 29, 2024Updated 2 years ago
- ☆28Aug 13, 2025Updated 6 months ago
- 使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序☆20Jul 4, 2024Updated last year
- Strong and Open Vision Language Assistant for Mobile Devices☆1,338Apr 15, 2024Updated last year
- WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation☆138Nov 12, 2023Updated 2 years ago