OpenDCAI/DataFlow-MM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenDCAI/DataFlow-MM)

OpenDCAI / DataFlow-MM

Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for Multimodal Large Language Models.

☆46

Alternatives and similar repositories for DataFlow-MM

Users that are interested in DataFlow-MM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenDCAI / DataMind
View on GitHub
All-in-one intelligent assistant powered by LlamaIndex — RAG, GraphRAG, NL2SQL, Skills & Memory with multimodal support.
☆22Jul 8, 2026Updated last week
OpenDCAI / Dataflow-LoopAI
View on GitHub
Dataflow-LoopAI is an intelligent system with self-optimization capabilities that automatically detects and evaluates generation deficien…
☆17Updated this week
OpenDCAI / DataFlow-KG
View on GitHub
DataFlow Knowledge Graph -- Knowledge graph data preparation with DataFlow style operators and pipelines
☆27Jun 30, 2026Updated 3 weeks ago
OpenDCAI / Flash-MinerU
View on GitHub
Ray-powered accelerator for MinerU, turning PDF → Markdown into a scalable, cluster-ready data infrastructure. 基于 Ray 的 MinerU 加速层，将 PDF …
☆63Apr 20, 2026Updated 3 months ago
arctanxarc / UniCTokens
View on GitHub
A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…
☆130Jun 15, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zwt233 / GAMLP
View on GitHub
☆19Mar 21, 2022Updated 4 years ago
tanABCC / VABench
View on GitHub
☆16Jul 8, 2026Updated last week
Aurora-slz / MM-Verify
View on GitHub
☆19Oct 28, 2025Updated 8 months ago
HKUST-KnowComp / NAACL
View on GitHub
The official codebase for our paper "NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems"
☆24Feb 28, 2026Updated 4 months ago
lijunxian111 / PlanViz
View on GitHub
Official repo for PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks
☆17Feb 17, 2026Updated 5 months ago
dada-qin / Data-Centric_LLM_Studies
View on GitHub
A list of papers about data quality in Large Language Models (LLMs)
☆27Dec 14, 2023Updated 2 years ago
SooLab / Part2Object
View on GitHub
[ECCV 2024] The official PyTorch implementation of the "Part2Object: Hierarchical Unsupervised 3D Instance Segmentation".
☆26Sep 12, 2024Updated last year
beccabai / Data-centric_multimodal_LLM
View on GitHub
Survey on Data-centric Large Language Models
☆94Jul 8, 2024Updated 2 years ago
OpenDCAI / OpenWorldLib
View on GitHub
Unified Codebase for Advanced World Models.
☆843Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
QuantaAlpha / chain-of-mindset
View on GitHub
☆65Feb 11, 2026Updated 5 months ago
ChenShawn / MultiModal-Jupyter-Sandbox
View on GitHub
Simple code sandbox supporting jupyter notebook style code execution. Used for agent training
☆24Dec 5, 2025Updated 7 months ago
LiuHengyu321 / IR3D-Bench
View on GitHub
[NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering
☆46Oct 15, 2025Updated 9 months ago
dsdanielpark / arxiv2text
View on GitHub
Converting PDF files to text, mainly with a focus on arXiv papers.
☆25Feb 19, 2024Updated 2 years ago
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆28Mar 15, 2026Updated 4 months ago
cbenge509 / arxiv-ai-analysis
View on GitHub
A visualization experience of AI/ML academic papers hosted on ArXiV - for project work at the University of California, Berkeley MIDS pro…
☆10Feb 10, 2023Updated 3 years ago
agents-x-project / PyVision
View on GitHub
[MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."
☆162Jul 22, 2025Updated 11 months ago
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 4 months ago
WesLee88524 / C-Drag-Official-Repo
View on GitHub
☆14Feb 28, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Advocate99 / AssetFormer
View on GitHub
[ICLR'2026] AssetFormer: Modular 3D Assets Generation with Autoregressive Transformer
☆37Feb 13, 2026Updated 5 months ago
NevaMind-AI / open-personal-agent
View on GitHub
The first open-sourced personalized agent
☆31Dec 1, 2025Updated 7 months ago
Fzkuji / swat-attention
View on GitHub
🚀 Sliding Window Attention Training for Efficient Large Language Models
☆19Jun 7, 2026Updated last month
SooLab / MVTokenFlow
View on GitHub
[ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow
☆27Apr 9, 2025Updated last year
stau-7001 / S3AI
View on GitHub
☆15Feb 18, 2025Updated last year
hwanyu112 / VIBE-Benchmark
View on GitHub
☆27Feb 3, 2026Updated 5 months ago
OpenDCAI / Data-Preparation-Bench
View on GitHub
☆216Jun 20, 2026Updated last month
LianjiaTech / astra
View on GitHub
ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…
☆148Jan 30, 2026Updated 5 months ago
ICTMCG / GRE
View on GitHub
Generative Regional Editing (GRE) Benchmark
☆20Sep 10, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NUS-HPC-AI-Lab / InfoGrowth
View on GitHub
Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data
☆20Aug 6, 2024Updated last year
lxtGH / DenseWorld-1M
View on GitHub
Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"
☆129Oct 2, 2025Updated 9 months ago
XDU-419Hub / ai-final-exam
View on GitHub
西电18级智科期末考试整理
☆16Jan 16, 2021Updated 5 years ago
Open-Model-Initiative / imagegen-speedrun
View on GitHub
We bring the spirit of nanogpt-speedrun into the omni-modal world
☆15Jan 31, 2026Updated 5 months ago
ypwang61 / StoryEval
View on GitHub
[CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
☆20May 2, 2025Updated last year
tadpole / AutoNE
View on GitHub
The Implementation of "AutoNE: Hyperparameter Optimization for Massive Network Embedding"(KDD 2019)
☆17Jul 6, 2023Updated 3 years ago
opendatalab / TRivia
View on GitHub
(CVPR 2026) TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
☆35Jul 14, 2026Updated last week