ZhihaoZhu/cap-vlm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZhihaoZhu/cap-vlm)

ZhihaoZhu / cap-vlm

Perceive, Predict, Verify: Continual Pre-training for Multimodal Agentic Foundation Models

☆82

Alternatives and similar repositories for cap-vlm

Users that are interested in cap-vlm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhihaoZhu / Auto-GUI-Code-Generation
View on GitHub
☆100Apr 4, 2026Updated 3 months ago
Hunyuan-PromptEnhancer / PromptEnhancer
View on GitHub
[CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
☆3,729Jun 10, 2026Updated last month
hyperai / tvm-cn
View on GitHub
TVM Documentation in Chinese Simplified / TVM 中文文档
☆3,855May 20, 2026Updated 2 months ago
limix-ldm-ai / LimiX
View on GitHub
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505
☆3,823Jun 16, 2026Updated last month
zhouxr6066 / Res-SAM
View on GitHub
Res-SAM Framework for GPR Underground Hazard Detection
☆1,620Jun 15, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ScorpioLea / AiCE
View on GitHub
Predicting high-fitness mutations based on protein inverse folding models
☆1,142Sep 30, 2025Updated 9 months ago
ModelEngine-Group / nexent
View on GitHub
Nexent is a zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles — unified tools, skill…
☆5,761Updated this week
ZJU4HealthCare / Foundations-of-Medical-LLMs
View on GitHub
Foundations of Medical Large Language Model Learning
☆1,696May 27, 2026Updated last month
Klavis-AI / klavis
View on GitHub
Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale
☆5,771Jun 1, 2026Updated last month
open-gigaai / giga-brain-0
View on GitHub
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
☆2,554Mar 10, 2026Updated 4 months ago
DataArcTech / DataArc-SynData-Toolkit
View on GitHub
Synthetic Data Generation Platform By DataArcTech
☆1,759Jun 30, 2026Updated 3 weeks ago
Kail-Fu / InterviewOS
View on GitHub
Replace coding puzzles with real-work simulations.
☆1,906Jul 10, 2026Updated last week
yizhang7210 / liang
View on GitHub
Liang - Non functional requirements should be part of function interfaces
☆1,010Nov 8, 2021Updated 4 years ago
Lucas0623z / NoteLite
View on GitHub
☆856Jul 9, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dindin0497 / HearIt
View on GitHub
☆1,843Feb 14, 2026Updated 5 months ago
siyuanchen0214 / Scam-AI-Multi-modal-Evaluation-System
View on GitHub
☆1,001Nov 5, 2025Updated 8 months ago
WildDataX / suppr-zotero-plugin
View on GitHub
Translate PDF, Word, PowerPoint, etc. | zotero翻译插件，微信扫码注册，新用户可免费翻译25万汉字或100万个英文字母。超能文献官网:suppr.wilddata.cn；
☆2,007Jun 24, 2026Updated 3 weeks ago
Devin-AXIS / iPolloWork
View on GitHub
A source-available, local-first alternative to Codex and Claude Code: an AI workspace for code, files, docs, websites, presentations, des…
☆1,559Updated this week
FrankChen021 / datastoria
View on GitHub
AI-native ClickHouse console for your cluster diagnostics and query generation, optimization and data visualization.
☆325Updated this week
YeQing17-2026 / OmniAgent
View on GitHub
An agent capable of self-evolving and dynamically hardening security
☆2,485May 25, 2026Updated last month
Team-Commonly / commonly
View on GitHub
Open-source workspace where your agents and team share one memory. Any runtime, your infra — self-host in one command, no per-agent fees.
☆1,263Updated this week
anysearch-ai / anysearch-mcp-server
View on GitHub
Unified real-time search engine skill for AI agents.
☆1,558Jul 10, 2026Updated last week
Emiyaaaaa / HiveMind
View on GitHub
☆1,077Jul 1, 2026Updated 2 weeks ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
fim-ai / fim-one
View on GitHub
Open-source agent platform for Global × China enterprises — wire every system through one agent core. Self-hosted, any LLM.
☆1,355Updated this week
SuanmoSuanyangTechnology / MemoryBear
View on GitHub
MemoryBear Equip AI with human-like memory capability
☆4,836Updated this week
dindin0497 / SeeIt
View on GitHub
☆1,541Sep 18, 2025Updated 10 months ago
FxPool / FXMinerProxy
View on GitHub
🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用
☆3,709Updated this week
lattegou / airtiz
View on GitHub
☆1,203Apr 25, 2026Updated 2 months ago
open-gigaai / giga-models
View on GitHub
GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models
☆1,054Dec 8, 2025Updated 7 months ago
DeepWism / DeepWism-R2
View on GitHub
DeepWism R2 is a next-generation AGI system built on the T3CEDS framework (Thin-Thick-Thin Crowd Entropy Dynamics System), which redefine…
☆1,016Jun 27, 2025Updated last year
OpenDCAI / DataFlow
View on GitHub
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
☆6,693Updated this week
LazyAGI / LazyLLM
View on GitHub
Easiest and laziest way for building multi-agent LLMs applications.
☆3,855Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kklt92 / awesome-ai-extensions
View on GitHub
A curated list of AI-powered browser extensions, plugins, and add-ons.
☆528Apr 8, 2026Updated 3 months ago
qualcomm / GenieX
View on GitHub
Run frontier LLMs and VLMs locally on Qualcomm devices across NPU, GPU, and CPU with a few lines of code
☆8,234Updated this week
LightningRAG / LightningRAG
View on GitHub
LightningRAG is a full-stack Vue + Gin starter with a decoupled frontend and backend, plus built-in, extensible RAG (retrieval-augmented …
☆458Apr 17, 2026Updated 3 months ago
kevinluosl / deepbot
View on GitHub
DeepBot is a system-level AI assistant built for both personal productivity and enterprise workflows — one-click setup, seamless experien…
☆2,361May 23, 2026Updated last month
wang-rui / phishguard-scaffold
View on GitHub
Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media via LLama- Based Modeling
☆1,008Feb 15, 2026Updated 5 months ago
paean-ai / deeptide
View on GitHub
Built by DeepSeek, for DeepSeek — a Swift-native macOS coding agent
☆1,000Jul 8, 2026Updated last week
evelyyyyynnnnn / 3.0-Financial-Ai-Systems
View on GitHub
Machine learning and optimization models designed to enhance financial system stability through portfolio optimization, risk forecasting,…
☆283Mar 20, 2026Updated 4 months ago