Wakals/CoVT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Wakals/CoVT)

Wakals / CoVT

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

☆306

Alternatives and similar repositories for CoVT

Users that are interested in CoVT are comparing it to the libraries listed below

Sorting:

soong-prog / instGallery
View on GitHub
🎨An interactive multimedia virtual art gallery一个互动多媒体虚拟画廊
☆30Jun 21, 2025Updated 8 months ago
13906334209 / gis
View on GitHub
☆30Nov 29, 2025Updated 3 months ago
clinton1116 / scope-st
View on GitHub
Self-supervised graph diffusion encoder for spatial transcriptomics data (SCOPE-ST).
☆32Nov 26, 2025Updated 3 months ago
aiot-lab-yin / Dockerized-wifi-iq-preamble-capture
View on GitHub
Wi-Fi IQ capture system using USRP B210 (Dockerized)
☆51Nov 13, 2025Updated 3 months ago
mire403 / VisionForge
View on GitHub
VisionForge是一个轻量级、高扩展性的大模型图片训练&描述工具生成器，支持多家大模型API（Google、OpenAI 兼容、DeepSeek、Qwen、GLM、Claude、Doubao、自定义模型）。它提供多图片上传、提示词优化、自动生成JSONL训练数据、多…
☆47Jan 22, 2026Updated last month
ylouis83 / Podcastllm
View on GitHub
☆18Dec 8, 2025Updated 2 months ago
Brotherc / openplatform
View on GitHub
企业级开放平台，包括文档中心、API中心等
☆128Sep 20, 2025Updated 5 months ago
pzcddm / FastSketchLSH
View on GitHub
This is the repo to implement the fastsketch or even better minhash-based jaccard estimator with Locality sensitive hashing to deduplicat…
☆41Feb 19, 2026Updated last week
WyRainBow / Resume-Agent
View on GitHub
一句话生成在线可修改的PDF
☆129Updated this week
ordylan / FenScribe
View on GitHub
FenScribe - A Smart PDF Layout Optimizer
☆114Jul 22, 2025Updated 7 months ago
gqylpy / exceptionx
View on GitHub
The `exceptionx` is a flexible and convenient Python exception handling library that allows you to dynamically create exception classes a…
☆129May 14, 2025Updated 9 months ago
zhensherlock / vue-devtools-unlocker
View on GitHub
Enable Vue DevTools in production environments
☆126Updated this week
Alanosy / AgilePMB
View on GitHub
基于SpringBoot的项目管理系统-后端
☆124Jul 12, 2025Updated 7 months ago
Shengxiang-Lin / QAQ-QQ-AI-QUEST
View on GitHub
This project develops an intelligent chatbot system in Rust language, builds a secure and stable server to store user information and cha…
☆147Nov 8, 2025Updated 3 months ago
zlsgo / zdb
View on GitHub
小巧的 Golang 数据库操作库
☆101Dec 24, 2025Updated 2 months ago
haowei-freesky / HERMES
View on GitHub
Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"
☆57Jan 23, 2026Updated last month
wzqvip / jetson-pytorch-builder
View on GitHub
build PyTorch with CUDA for Jetson Orin and Thor.
☆32Dec 4, 2025Updated 2 months ago
styufo / Causal-Story
View on GitHub
Local nonlinear causal attention latent diffusion models for visual story synthesizing
☆29Apr 3, 2025Updated 11 months ago
rlqja1107 / NL-VSGG
View on GitHub
Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…
☆23Jun 13, 2025Updated 8 months ago
AIFrontierLab / UniGame
View on GitHub
official code for unigame
☆19Nov 26, 2025Updated 3 months ago
Sakuric / git-finance-dashboard
View on GitHub
智能金融投资平台 (Finance Dashboard)
☆35Feb 23, 2026Updated last week
Ali2500 / ViCaS
View on GitHub
ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)
☆18Apr 2, 2025Updated 11 months ago
mlvlab / DeepVideoR1
View on GitHub
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆31Feb 22, 2026Updated last week
eric-ai-lab / DMLR
View on GitHub
Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"
☆63Dec 17, 2025Updated 2 months ago
visinf / veto
View on GitHub
Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)
☆22Sep 27, 2023Updated 2 years ago
zhensherlock / intellij-platform-git-stats-plugin
View on GitHub
git statistics plugins for IntelliJ Platform
☆184May 14, 2025Updated 9 months ago
Lrunlin / game-mall
View on GitHub
基于Vue+Node的游戏装备交易商城分为用户管理员 Vue+Koa+MySQL+scoket.io
☆72Apr 15, 2025Updated 10 months ago
kkkzheli / Water-UI-NEXT
View on GitHub
A MCBE JSON UI Resource Pack
☆223Jul 15, 2025Updated 7 months ago
OpenGithubs / china-Internet-opensource
View on GitHub
整理了各大厂的 GitHub 地址及热门开源项目，帮助大家更高效地了解国产开源生态
☆114Jul 3, 2025Updated 8 months ago
footuser / yolo_wechat_qrcode
View on GitHub
提高微信二维码识别精确率的小工具
☆102Apr 21, 2025Updated 10 months ago
johnson111788 / SpatialReasoner
View on GitHub
Training recipe for SpatialReasoner
☆38Sep 21, 2025Updated 5 months ago
Lrunlin / car-mall
View on GitHub
基于Vue+Node的汽车销售商城分为用户门店管理员 Vue+Koa+MySQL+scoket.io
☆54Feb 1, 2025Updated last year
Shengxiang-Lin / COMPSCI-288
View on GitHub
UCB CS 288. Natural Language Processing
☆98Feb 23, 2025Updated last year
mbzuai-oryx / TrackingMeetsLMM
View on GitHub
☆10Apr 7, 2025Updated 10 months ago
mislav / contacts
View on GitHub
Ruby library for consuming Google, Yahoo!, Flickr and Windows Live contact APIs
☆421Mar 19, 2010Updated 15 years ago
ls-kelvin / REVPT
View on GitHub
Code for paper: Reinforced Vision Perception with Tools
☆71Oct 3, 2025Updated 5 months ago
sohaha / zzz
View on GitHub
Go程序热编译、压力测试等，日常开发辅助工具,提升开发效率 - Daily development aids
☆234Jan 20, 2026Updated last month
LWL-cpu / Question-Free-Fine-Tuning
View on GitHub
[NeurIPS 2025 spotlight] QFFT, Question-Free Fine-Tuning for Adaptive Reasoning
☆91Nov 4, 2025Updated 3 months ago
alvations / sacremoses
View on GitHub
Python port of Moses tokenizer, truecaser and normalizer
☆124Apr 27, 2023Updated 2 years ago