an method to make vlm think like r1
☆21May 28, 2025Updated 10 months ago
Alternatives and similar repositories for deepseek-r1-vision
Users that are interested in deepseek-r1-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cantonese TTS frontend☆16Oct 14, 2019Updated 6 years ago
- Thinker☆40Feb 26, 2026Updated last month
- 🙌 OpenHands: Code Less, Make More☆11Jan 8, 2025Updated last year
- ☆13Jul 11, 2025Updated 9 months ago
- RESAnything: Attribute Prompting for Arbitrary Referring Segmentation☆17Nov 28, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆148Oct 10, 2025Updated 6 months ago
- Convert StyleGAN2 PyTorch to PaddlePaddle☆12Aug 18, 2021Updated 4 years ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆578Apr 13, 2025Updated last year
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- Utilizes ONNX Runtime for TTS model.☆57Mar 19, 2026Updated last month
- ☆18Jan 17, 2022Updated 4 years ago
- 全网首发,mmdetection Co-DETR TensorRT端到端推理加速☆39Nov 27, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆11Sep 22, 2025Updated 6 months ago
- Automatic defect recognition in X-ray testing using computer vision☆13Dec 8, 2018Updated 7 years ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Jan 25, 2024Updated 2 years ago
- EKF for Radar and Lidar measurements to estimate the position and velocity an object, for example a pedestrian☆12Jun 18, 2020Updated 5 years ago
- ☆17Apr 11, 2025Updated last year
- ☆23Oct 3, 2022Updated 3 years ago
- Image style transfer using Convolutional Neural Networks☆13May 1, 2017Updated 8 years ago
- Repo for Polyphone Disambiguation in Mandarin Chinese with Semi-Supervised Learning☆15Feb 26, 2022Updated 4 years ago
- 使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序☆82Feb 2, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆1,065Mar 20, 2026Updated 3 weeks ago
- convert 3D point cloud map (.pcd) to 3D occupancy grid map (.pgm)☆12Mar 15, 2024Updated 2 years ago
- ARPABET transcription syllabifier module☆16Aug 25, 2022Updated 3 years ago
- ☆29Dec 12, 2024Updated last year
- REB:Reducing Biases in Representation for Industrial Anomaly Detection☆26Jan 22, 2024Updated 2 years ago
- Depth maps Super Resolution using PaddlePaddle☆24Nov 20, 2022Updated 3 years ago
- 用koch复现lerobot—遥操作数据采集—act复现—diffusion model复现—Pi模型复现—视觉大模型☆26May 16, 2025Updated 11 months ago
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆28Mar 20, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Desktop Widget Manager. Think of conky, but with Python instead of Lua.☆13Jun 10, 2020Updated 5 years ago
- A chat UI for Llama.cpp☆16Apr 10, 2026Updated last week
- YOLOv5在高通AI Engine Direct环境下进行QNN量化,CPU推理的项目☆16Sep 10, 2024Updated last year
- [ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models☆69Feb 13, 2024Updated 2 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 8 years ago
- Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.☆14May 3, 2025Updated 11 months ago
- [ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllabili…☆20Sep 6, 2025Updated 7 months ago