an method to make vlm think like r1
☆21May 28, 2025Updated 11 months ago
Alternatives and similar repositories for deepseek-r1-vision
Users that are interested in deepseek-r1-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆15Sep 3, 2025Updated 8 months ago
- cantonese-mandarin unsupervised neural translation for sw project☆29May 2, 2023Updated 3 years ago
- Thinker☆40Feb 26, 2026Updated 2 months ago
- Galaxea's first diffusion policy release☆37Aug 18, 2025Updated 8 months ago
- RESAnything: Attribute Prompting for Arbitrary Referring Segmentation☆18Nov 28, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆148Oct 10, 2025Updated 6 months ago
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆33Sep 10, 2025Updated 7 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆579Apr 13, 2025Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- A tool for evaluating landscape health benefits and supporting evidence-based design☆16Jan 6, 2022Updated 4 years ago
- Utilizes ONNX Runtime for TTS model.☆61Mar 19, 2026Updated last month
- ☆18Jan 17, 2022Updated 4 years ago
- [IJCV 2024] Hard-normal Example-aware Template Mutual Matching for Industrial Anomaly Detection☆24Jan 1, 2025Updated last year
- 使用django+pyecharts+PP-Human开发的动态数据大屏, 有人流数据的采集入库, 打架、摔倒等事件警报,口罩检测等实用功能。边缘端版本使用onnx推理提升效率,服务端版本支持视频流推拉☆33May 3, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 全网首发,mmdetection Co-DETR TensorRT端到端推理加速☆39Nov 27, 2024Updated last year
- [SIGGRAPH Asia 2023] ReShader: View-Dependent Highlights for Single Image View-Synthesis☆18Mar 9, 2024Updated 2 years ago
- ☆11Sep 22, 2025Updated 7 months ago
- Automatic defect recognition in X-ray testing using computer vision☆13Dec 8, 2018Updated 7 years ago
- A real-world autonomous driving simulator based on 3D Gaussian Splatting for scene augmentation☆16Jun 10, 2024Updated last year
- Code and website for "GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation"☆40Oct 9, 2025Updated 7 months ago
- paddle code convert toolkit☆22Mar 19, 2023Updated 3 years ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Jan 25, 2024Updated 2 years ago
- ☆18Oct 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free Android Developers Speech Recognition API) then TRANSLATE (usin…☆13May 5, 2024Updated 2 years ago
- 利用PaddleSpeech合成原神角色纳西妲声音☆12Dec 6, 2022Updated 3 years ago
- ☆17Apr 11, 2025Updated last year
- ☆23Oct 3, 2022Updated 3 years ago
- Image style transfer using Convolutional Neural Networks☆13May 1, 2017Updated 9 years ago
- Repo for Polyphone Disambiguation in Mandarin Chinese with Semi-Supervised Learning☆15Feb 26, 2022Updated 4 years ago
- 使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序☆83Feb 2, 2024Updated 2 years ago
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆975Mar 20, 2026Updated last month
- A list of various eye- and head-tracking software, products, etc. ℹ️ This is just a push-mirror. We develop here: https://codeberg.org/ey…☆22Apr 24, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Given an input RGB image, we generate novel viewpoints that simulate a 3D interactive experience.☆23Apr 26, 2023Updated 3 years ago
- ☆29Dec 12, 2024Updated last year
- gcc+newlib and gcc+glibc toolchains☆17Apr 12, 2019Updated 7 years ago
- REB:Reducing Biases in Representation for Industrial Anomaly Detection☆26Jan 22, 2024Updated 2 years ago
- Build and program your own desktop companion robot that performs gestures while it renders text phrases as speech.☆16Jul 6, 2023Updated 2 years ago
- Depth maps Super Resolution using PaddlePaddle☆24Nov 20, 2022Updated 3 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year