Ascend/MindSpeed-MM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ascend/MindSpeed-MM)

Ascend / MindSpeed-MM

☆43

Alternatives and similar repositories for MindSpeed-MM

Users that are interested in MindSpeed-MM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Ascend / MindSpeed-RL
View on GitHub
☆58May 20, 2026Updated last month
amazon-science / PIXELS
View on GitHub
Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"
☆11Dec 17, 2024Updated last year
mashijie1028 / GenHancer
View on GitHub
(ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.
☆78Jun 25, 2025Updated last year
open-nlplab / fastchatgpt
View on GitHub
A python tool help to interact with chatgpt.
☆10Dec 11, 2022Updated 3 years ago
Tang1705 / Video-Super-Resolution-Rankings
View on GitHub
☆25Jul 31, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WangRongsheng / IvyGPT
View on GitHub
[CICAI 2023] The official codes for "Ivygpt: Interactive chinese pathway language model in medical domain"
☆57Sep 29, 2024Updated last year
cyr19 / MENLI
View on GitHub
☆17Nov 20, 2023Updated 2 years ago
JennyXieJiayi / UnifiedSSR
View on GitHub
[WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation
☆12Feb 16, 2024Updated 2 years ago
personalizedretrieval / xpert
View on GitHub
Code for XPERT algorithm from Personalized Retrieval over Millions of Items
☆13Sep 14, 2023Updated 2 years ago
MetabrainAGI / Awaker2.5-VL
View on GitHub
☆35Jan 21, 2025Updated last year
shawn0728 / Unify-Agent
View on GitHub
🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.
☆83May 2, 2026Updated 2 months ago
jerry4h / Face_Xray
View on GitHub
A 3rd-party implemented Face-Xray for deepfake detection.
☆13Jun 2, 2020Updated 6 years ago
qiuyue1993 / Notes
View on GitHub
Research Notes
☆11Sep 13, 2020Updated 5 years ago
Rintarooo / PSPNet
View on GitHub
Semantic Segmentation for CityScapes dataset, Pyramid Scene Parsing Network
☆10Nov 7, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
guoshangwei / Research-Handbook-CS
View on GitHub
给科研小白的一些资源与工具推荐
☆18Jul 6, 2020Updated 6 years ago
zhengzhi-1997 / LLM-TRSR
View on GitHub
☆16May 22, 2025Updated last year
cheryyunl / ROVER
View on GitHub
Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
☆26Dec 12, 2025Updated 6 months ago
Zheng222 / VideoHDR
View on GitHub
☆17Jun 17, 2020Updated 6 years ago
QinRui-k / GCN-CIS
View on GitHub
☆14May 27, 2024Updated 2 years ago
alexsusu / video-diff
View on GitHub
Video Alignment for Change Detection (Video-Based Change Detection) computer vision application
☆12Jan 20, 2020Updated 6 years ago
QinRui-k / MVC-Net
View on GitHub
☆16Sep 26, 2024Updated last year
comeonyang / Depth-Estimation-DCNF
View on GitHub
Depth estimation of a single RGB image based on deep convolutional neural fields.
☆19May 26, 2017Updated 9 years ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
QMME / T2VQA
View on GitHub
☆27Nov 27, 2024Updated last year
boyazeng / weight_memorization
View on GitHub
Code release for "Generative Modeling of Weights: Generalization or Memorization?"
☆22Apr 9, 2026Updated 2 months ago
micky-li-hd / CoCo
View on GitHub
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
☆54Apr 9, 2026Updated 2 months ago
jbender / docker-kaldi
View on GitHub
☆21Feb 5, 2018Updated 8 years ago
TencentARC / ARC-Chapter
View on GitHub
Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
☆43Nov 19, 2025Updated 7 months ago
LaVi-Lab / Visual-Table
View on GitHub
[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
☆20Oct 17, 2024Updated last year
DerrickWang005 / LaVin-DiT
View on GitHub
Official implementation of LaVin-DiT
☆53Jan 27, 2025Updated last year
jayusxp / UECA-Prompt
View on GitHub
UECA-Prompt: Universal Prompt for Emotion Cause Analysis（COLING 2022）
☆16Jun 6, 2023Updated 3 years ago
weitianxin / UniMP
View on GitHub
[ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond
☆23Apr 29, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
AI-Hub-Admin / SEMINAR
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
LuoweiZhou / detectron-vlp
View on GitHub
Detectron for image/video region feature extraction, inspired by Xinlei's repo
☆22Nov 21, 2020Updated 5 years ago
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆51Apr 7, 2026Updated 2 months ago
mtyiu / memec
View on GitHub
MemEC: An Erasure-Coding-Based Distributed In-Memory Key-Value Store
☆11Mar 30, 2017Updated 9 years ago
coldmanck / RVL-BERT
View on GitHub
The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…
☆18Oct 21, 2022Updated 3 years ago
bruceyo / V-PETL
View on GitHub
Towards a Unified View on Visual Parameter-Efficient Transfer Learning
☆26Oct 13, 2022Updated 3 years ago