opendatalab/opendatalab-datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/opendatalab/opendatalab-datasets)

opendatalab / opendatalab-datasets

datasets resource

☆150

Alternatives and similar repositories for opendatalab-datasets

Users that are interested in opendatalab-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

opendatalab / labelU-Kit
View on GitHub
Data annotation component library --provided as NPM packages
☆157Updated this week
opendatalab / VIGC
View on GitHub
AAAI 2024: Visual Instruction Generation and Correction
☆97Feb 4, 2024Updated 2 years ago
opendatalab / WanJuan2.0-WanJuan-CC
View on GitHub
WanJuan-CC是以CommonCrawl为基础，经过数据抽取，规则清洗，去重，安全过滤，质量清洗等步骤得到的高质量数据。
☆14Apr 18, 2024Updated 2 years ago
opendatalab / labelU
View on GitHub
Open-source multimodal data annotation platform with AI auto-annotation support.
☆1,629Updated this week
opendatalab / Vis3
View on GitHub
Data browser based on s3. 一个基于 S3 的数据（json / jsonl / parquet / html / md等）可视化工具。👇 Try online.
☆89Apr 14, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
opendatalab / MLLM-DataEngine
View on GitHub
MLLM-DataEngine: An Iterative Refinement Approach for MLLM
☆49May 24, 2024Updated 2 years ago
open-mmlab / labelbee
View on GitHub
LabelBee is an annotation Library
☆303Updated this week
opendatalab / Meta-rater
View on GitHub
[ACL 2025 Best Theme Paper] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for…
☆196Aug 29, 2025Updated 10 months ago
opendatalab / CLIP-Parrot-Bias
View on GitHub
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆66Sep 6, 2024Updated last year
opendatalab / LabelLLM
View on GitHub
The Open-Source Data Annotation Platform
☆1,262Jul 2, 2026Updated 2 weeks ago
open-mmlab / labelbee-client
View on GitHub
Out-of-the-box Annotation Toolbox
☆395Apr 19, 2024Updated 2 years ago
opendatalab / OHR-Bench
View on GitHub
(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆104Dec 3, 2025Updated 7 months ago
opendatalab / mineru-vl-utils
View on GitHub
A Python package for interacting with the MinerU Vision-Language Model.
☆136Jun 11, 2026Updated last month
opendatalab / UniMERNet
View on GitHub
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆492Sep 28, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EluvK / yaaa
View on GitHub
Flutter Client APP for LLMs(OpenAI, DeepSeek...)
☆15Jul 3, 2025Updated last year
opendatalab / laion5b-downloader
View on GitHub
☆121Jan 15, 2026Updated 6 months ago
iscyy / RTDETR
View on GitHub
专注于改进RT-DETR模型，🚀 in PyTorch >, Support to improve backbone, neck, head, loss, IoU and other modules🚀based on Ultralytics
☆20May 8, 2024Updated 2 years ago
xrose3159 / PaperPub
View on GitHub
PaperPub is an academic arena where diverse AI Agents read papers daily, pick apart each other's arguments, and fiercely debate.
☆43Jun 12, 2026Updated last month
LivingSkyTechnologies / Dense_Article_Dataset_DAD
View on GitHub
Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis
☆16Jan 13, 2022Updated 4 years ago
opendatalab / Miner-PDF-Benchmark
View on GitHub
MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
☆24Dec 11, 2024Updated last year
chineseocr / ai-medical
View on GitHub
陆续开源医疗行业的深度学习模型及数据集
☆13Dec 30, 2021Updated 4 years ago
Focusshang / Tutorial
View on GitHub
☆14Apr 19, 2024Updated 2 years ago
opendatalab / CrossViewDiff
View on GitHub
The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"
☆16Sep 2, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,232Apr 14, 2025Updated last year
MigoXLab / dingo
View on GitHub
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
☆728Jul 13, 2026Updated last week
chenxli / CS-note
View on GitHub
C++开发\机器学习\深度学习\推荐算法基础知识及面试题总结
☆22Mar 4, 2021Updated 5 years ago
opendatalab / LOKI
View on GitHub
[ICLR 2025 Spotlight] The official implementation of the paper “LOKI：A Comprehensive Synthetic Data Detection Benchmark using Large Multi…
☆180Feb 7, 2026Updated 5 months ago
ffreemt / deepl-tr-async
View on GitHub
deepl translate via pyppeteer
☆11Oct 17, 2023Updated 2 years ago
snssll / poe2openai
View on GitHub
A tool that converts the POE developer API to the standard OpenAI API format.
☆19Sep 22, 2024Updated last year
InternLM / InternLM
View on GitHub
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
☆7,244Oct 30, 2025Updated 8 months ago
InternLM / InternLM-XComposer
View on GitHub
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
☆2,921May 26, 2025Updated last year
open-compass / T-Eval
View on GitHub
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
☆312Apr 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
opendatalab / HA-DPO
View on GitHub
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
☆104Jan 30, 2024Updated 2 years ago
kuaileBenbi / RK3588-yolov5-sort
View on GitHub
在RK3588上实现的yolov5+sort目标检测与跟踪（c++版本）
☆13May 28, 2024Updated 2 years ago
cqu20160901 / yolov8pose_dfl_rknn_Cplusplus
View on GitHub
yolov8pose 部署版本，将DFL放在后处理中，部署rk3588.
☆11Aug 21, 2024Updated last year
ygcinar / SmoothI
View on GitHub
☆10May 9, 2021Updated 5 years ago
StephenDHYang / UGBS-pytorch
View on GitHub
The official pytorch implementation of Exploring the User Guidance for More Accurate Building Segmentation from High-Resolution Remote Se…
☆18May 27, 2024Updated 2 years ago
liuchengwucn / FIMO
View on GitHub
☆38Jun 30, 2026Updated 3 weeks ago
sagoo-cloud / iotgateway
View on GitHub
SagooIOT 网关基础库
☆19Apr 3, 2026Updated 3 months ago