sungatetop/deepseek-r1-vision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sungatetop/deepseek-r1-vision)

sungatetop / deepseek-r1-vision

an method to make vlm think like r1

☆21

Alternatives and similar repositories for deepseek-r1-vision

Users that are interested in deepseek-r1-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

meganndare / cantonese-nlp
View on GitHub
cantonese-mandarin unsupervised neural translation for sw project
☆29May 2, 2023Updated 3 years ago
minujeong / hdri-to-cubemap
View on GitHub
Generate 6 cubemap textures from panorama image
☆11Jul 27, 2018Updated 8 years ago
JustinTebbe / Dynamic-noise-AD
View on GitHub
☆13Jul 11, 2025Updated last year
HighCWu / stylegan2-pytorch2paddle
View on GitHub
Convert StyleGAN2 PyTorch to PaddlePaddle
☆12Aug 18, 2021Updated 4 years ago
Fancy-MLLM / R1-Onevision
View on GitHub
R1-onevision, a visual language model capable of deep CoT reasoning.
☆581Apr 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
360CVGroup / Bridge_Diffusion_Model
View on GitHub
Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025
☆13Jun 25, 2024Updated 2 years ago
Hectormxy / OP-SAM
View on GitHub
The official implementation of ICCV 25 OP-SAM "One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Ite…
☆15Jul 9, 2025Updated last year
mbshbn / Lidar-Radar-sensor-fusion-for-pedestrian-prediction-using-EKF
View on GitHub
EKF for Radar and Lidar measurements to estimate the position and velocity an object, for example a pedestrian
☆12Jun 18, 2020Updated 6 years ago
JiehangXie / Landscape-Heath-Score
View on GitHub
A tool for evaluating landscape health benefits and supporting evidence-based design
☆16Jan 6, 2022Updated 4 years ago
harpreetsahota204 / CRADIOv4
View on GitHub
Implementing C-RADIOv4 as a Remote Source Zoo Model for FiftyOne
☆18Feb 4, 2026Updated 5 months ago
revospeech / audio-generation-papers
View on GitHub
recent audio generation papers (including speech, music and general audios)
☆13Mar 14, 2023Updated 3 years ago
microsoft / gestureBotDesignKit
View on GitHub
Build and program your own desktop companion robot that performs gestures while it renders text phrases as speech.
☆16Jul 6, 2023Updated 3 years ago
percent4 / yi_vl_experiment
View on GitHub
本项目是关于Yi的多模态系列模型，如Yi-VL-6B/34B等的实验与应用。
☆14Jan 25, 2024Updated 2 years ago
DataXujing / Co-DETR-TensorRT
View on GitHub
全网首发，mmdetection Co-DETR TensorRT端到端推理加速
☆39Nov 27, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NarcissusEx / HETMM
View on GitHub
[IJCV 2024] Hard-normal Example-aware Template Mutual Matching for Industrial Anomaly Detection
☆26Jan 1, 2025Updated last year
zhwesky2010 / PaConvert-Origin
View on GitHub
paddle code convert toolkit
☆22Mar 19, 2023Updated 3 years ago
domingomery / Xdefects
View on GitHub
Automatic defect recognition in X-ray testing using computer vision
☆13Dec 8, 2018Updated 7 years ago
yzhangchuck / awesome-llm-reasoning-long2short-papers
View on GitHub
☆17Apr 11, 2025Updated last year
botbahlul / Live-Subtitle-V2
View on GitHub
ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free Android Developers Speech Recognition API) then TRANSLATE (usin…
☆14May 5, 2024Updated 2 years ago
milgra / mmfm
View on GitHub
MultiMedia File Manager
☆14Sep 24, 2024Updated last year
Asassong / nahida-AI-dub
View on GitHub
利用PaddleSpeech合成原神角色纳西妲声音
☆12Dec 6, 2022Updated 3 years ago
felixkreuk / audiogen
View on GitHub
☆23Oct 3, 2022Updated 3 years ago
Plumess / yolov5-qnn
View on GitHub
YOLOv5在高通AI Engine Direct环境下进行QNN量化，CPU推理的项目
☆17Sep 10, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kkaiwwana / MVPbev
View on GitHub
[ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllabili…
☆20Sep 6, 2025Updated 10 months ago
kongdebug / Paddle-DSR-Lab
View on GitHub
Depth maps Super Resolution using PaddlePaddle
☆24Nov 20, 2022Updated 3 years ago
jart / sqlite-vec
View on GitHub
Work-in-progress vector search SQLite extension that runs anywhere.
☆10Jul 27, 2024Updated 2 years ago
tqh / TrigramJS
View on GitHub
A javascript library for trigram indexing and finding. If you want to know more about trigrams and how to use them try the example, and r…
☆12Dec 1, 2019Updated 6 years ago
yazone / g2pE_mobile
View on GitHub
g2p for english tts
☆19Nov 10, 2022Updated 3 years ago
huzongxiang / sowa
View on GitHub
☆29Dec 12, 2024Updated last year
ShuaiLYU / REB
View on GitHub
REB：Reducing Biases in Representation for Industrial Anomaly Detection
☆26Jan 22, 2024Updated 2 years ago
p12tic / libdesktop-agnostic
View on GitHub
A fork of libdesktop-agnostic
☆15May 2, 2016Updated 10 years ago
UrbanArchitect / UrbanArchitect
View on GitHub
The official repository of our paper: "Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior"
☆113Apr 26, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
b0kch01 / llama-cpu
View on GitHub
🦙 Inference code for LLaMA models (modified for cpu)
☆12Mar 4, 2023Updated 3 years ago
hclarke / triangle-sine
View on GitHub
☆11Apr 28, 2024Updated 2 years ago
P403n1x87 / blighty
View on GitHub
Desktop Widget Manager. Think of conky, but with Python instead of Lua.
☆13Jul 21, 2026Updated last week
bupticybee / icytranslate_offline
View on GitHub
The offline part of icytranslate(a english-chinese translate platform) ,the output of this project should be a translate model
☆19Aug 19, 2017Updated 8 years ago
LeijieZhang001 / leijiezhang001.github.io
View on GitHub
☆12Mar 1, 2023Updated 3 years ago
yaodongC / DriveLLM
View on GitHub
☆16Sep 11, 2023Updated 2 years ago
alibaba / alimama-video-narrator
View on GitHub
Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"
☆42Dec 27, 2024Updated last year