🏭 Mega Scale Multimodal DataPipeline for SOTA Foundation Models
☆358Mar 25, 2026Updated 2 weeks ago
Alternatives and similar repositories for mega-data-factory
Users that are interested in mega-data-factory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Explore PyTorch projects with ease☆163Jan 4, 2025Updated last year
- A high-performance machine learning library in pure Rust, offering statistical utilities, ML algorithms and neural networks, and future s…☆338Mar 17, 2026Updated 3 weeks ago
- 🐹 An Intelligent Phone That Never Sleeps.☆825Mar 23, 2026Updated 2 weeks ago
- Vanus is a Serverless, event streaming system with processing capabilities. It easily connects SaaS, Cloud Services, and Databases to he…☆1,697Mar 11, 2024Updated 2 years ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,580Apr 1, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 🔬🦞 A self-evolving AI research colleague for scientists. 285 skills, zero hallucination, persistent memory.☆561Apr 2, 2026Updated last week
- An OpenClaw-native knowledge retention skill that turns raw inputs into structured practice so you can use what you know, not just store …☆415Mar 10, 2026Updated last month
- Configurable Multi-layered AI Agentic Safety Framework☆330Feb 15, 2026Updated last month
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,866Mar 22, 2026Updated 2 weeks ago
- MLEvolve is an open-source autonomous system for end-to-end machine learning algorithm design and optimization powered by progressive sea…☆255Mar 31, 2026Updated last week
- The first open autoregressive foundational video AI model.☆2,892Oct 14, 2024Updated last year
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,412Apr 2, 2026Updated last week
- The next generation deep reinforcement learning tookit☆3,462Jun 16, 2023Updated 2 years ago
- Official code repository for the research paper IDFuzz: Intelligent Directed Grey-box Fuzzing (USENIX Security 2025)☆87Jan 31, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery☆1,269Mar 17, 2026Updated 3 weeks ago
- [AAAI 2026] NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal Simulation☆260Mar 20, 2026Updated 3 weeks ago
- Spring Boot Starter: Auto-convert existing REST APIs (@RestController) to MCP Server with zero/low-code. Expose controllers as MCP Tools …☆100Mar 12, 2026Updated 3 weeks ago
- Making ENS domains Google-visible - Open-source architecture for Web3 identity SEO and Knowledge Panel optimization☆91Oct 22, 2025Updated 5 months ago
- TVM Documentation in Chinese Simplified / TVM 中文文档☆3,635Mar 12, 2026Updated 3 weeks ago
- Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale☆5,699Updated this week
- 悟空CRM-基于Spring Cloud Alibaba微服务架构 +vue ElementUI的前后端分离CRM系统☆2,408Aug 27, 2021Updated 4 years ago
- Two languages, one purpose: turning words into geometry.☆160Dec 31, 2025Updated 3 months ago
- Personal Website with Obsidian-like knowledge graph☆221Feb 8, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Open-source ad fraud detection for small businesses using machine learning. Detect click fraud and bot traffic from Google Ads, Facebook …☆224Dec 17, 2025Updated 3 months ago
- Bitalostored is a high-performance distributed storage system, core engine based on bitalosdb(self-developed), compatible with Redis prot…☆2,161Apr 3, 2026Updated last week
- Align Anything: Training All-modality Model with Feedback☆4,643Nov 27, 2025Updated 4 months ago
- A Doctor for your data☆3,489Jan 14, 2025Updated last year
- A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, wh…☆1,496Jul 21, 2023Updated 2 years ago
- PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,657Jan 26, 2026Updated 2 months ago
- LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data…☆3,227Apr 1, 2026Updated last week
- AI-powered tool for efficient abstract and PDF screening in systematic reviews.☆1,306Apr 1, 2026Updated last week
- Powerful Python-based tool for scraping Tweets, user data, and trends from Twitter without needing API access or authentication, offering…☆130Jan 4, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Run AI models end-to-end encrypted.☆3,075Feb 10, 2025Updated last year
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,167Dec 15, 2025Updated 3 months ago
- ☆31Feb 3, 2025Updated last year
- ☆82Jan 28, 2026Updated 2 months ago
- ☆83Jan 24, 2026Updated 2 months ago
- UFO³: Weaving the Digital Agent Galaxy☆8,378Apr 3, 2026Updated last week
- A high-performance IM server.☆4,002Mar 29, 2026Updated last week