an method to make vlm think like r1
☆21May 28, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-vision
Users that are interested in deepseek-r1-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆16Sep 3, 2025Updated 9 months ago
- A multi-frame-inpainting script for stable diffusion webui☆11Apr 7, 2023Updated 3 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆29May 2, 2023Updated 3 years ago
- A face-detect and landmark-detect lib☆13Aug 17, 2021Updated 4 years ago
- ☆14Aug 6, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytorch version of the CVPR 2020 paper: Blindly Assess Image Quality in the Wild Guided by A Self-Adaptive Hyper Network☆12Jul 5, 2020Updated 5 years ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆149Oct 10, 2025Updated 8 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆581Apr 13, 2025Updated last year
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 5 years ago
- recent audio generation papers (including speech, music and general audios)☆13Mar 14, 2023Updated 3 years ago
- Utilizes ONNX Runtime for TTS model.☆64Jun 8, 2026Updated last week
- [IJCV 2024] Hard-normal Example-aware Template Mutual Matching for Industrial Anomaly Detection☆25Jan 1, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 全网首发,mmdetection Co-DETR TensorRT端到端推理加速☆39Nov 27, 2024Updated last year
- ☆11Sep 22, 2025Updated 8 months ago
- Automatic defect recognition in X-ray testing using computer vision☆13Dec 8, 2018Updated 7 years ago
- A real-world autonomous driving simulator based on 3D Gaussian Splatting for scene augmentation☆16Jun 10, 2024Updated 2 years ago
- paddle code convert toolkit☆22Mar 19, 2023Updated 3 years ago
- MultiMedia File Manager☆14Sep 24, 2024Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Jan 25, 2024Updated 2 years ago
- ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free Android Developers Speech Recognition API) then TRANSLATE (usin…☆13May 5, 2024Updated 2 years ago
- EKF for Radar and Lidar measurements to estimate the position and velocity an object, for example a pedestrian☆12Jun 18, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Oct 3, 2022Updated 3 years ago
- Image style transfer using Convolutional Neural Networks☆13May 1, 2017Updated 9 years ago
- Repo for Polyphone Disambiguation in Mandarin Chinese with Semi-Supervised Learning☆15Feb 26, 2022Updated 4 years ago
- A list of various eye- and head-tracking software, products, etc. ℹ️ This is just a push-mirror. We develop here: https://codeberg.org/ey…☆23Apr 24, 2026Updated last month
- convert 3D point cloud map (.pcd) to 3D occupancy grid map (.pgm)☆12Mar 15, 2024Updated 2 years ago
- 使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序☆85Feb 2, 2024Updated 2 years ago
- ARPABET transcription syllabifier module☆16Aug 25, 2022Updated 3 years ago
- ☆29Dec 12, 2024Updated last year
- REB:Reducing Biases in Representation for Industrial Anomaly Detection☆26Jan 22, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Work-in-progress vector search SQLite extension that runs anywhere.☆10Jul 27, 2024Updated last year
- Depth maps Super Resolution using PaddlePaddle☆24Nov 20, 2022Updated 3 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆16Mar 15, 2025Updated last year
- 用koch复现lerobot—遥操作数据采集—act复现—diffusion model复现—Pi模型复现—视觉大模型☆30May 16, 2025Updated last year
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- ☆11Apr 28, 2024Updated 2 years ago
- A chat UI for Llama.cpp☆16Jun 4, 2026Updated 2 weeks ago