cognitedata/Qwen-VL-finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cognitedata/Qwen-VL-finetune)

cognitedata / Qwen-VL-finetune

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

☆18

Alternatives and similar repositories for Qwen-VL-finetune

Users that are interested in Qwen-VL-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
ln-12 / UnrealOSM
View on GitHub
Unreal Engine plugin to load (precomputed) OpenStreetMap tiles
☆14Jun 25, 2022Updated 4 years ago
Pavansomisetty21 / Text-to-Images-Leveraging-Flux-AI-for-Text-to-Image-Generation
View on GitHub
we explores the fascinating domain of text-to-image generation using the powerful capabilities of the Flux API. The objective is to trans…
☆12Aug 14, 2024Updated last year
arneuro / cppCNN
View on GitHub
A c++ implementation of Convolutional neural network, with a MNIST hand-written digits recognition application
☆10Oct 20, 2020Updated 5 years ago
daemyung / practice-triton
View on GitHub
삼각형의 실전! Triton
☆16Feb 15, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
BARUDA-AI / Awesome-Medical-LLM
View on GitHub
Large language model of Medical AI, General Medical AI (GMAI)
☆17Jan 30, 2024Updated 2 years ago
pashanitw / llama3-and-friends-from-scratch
View on GitHub
☆11Oct 3, 2024Updated last year
ugorsahin / Generative-Negative-Mining
View on GitHub
[WACV 2024] Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024
☆13Jan 3, 2024Updated 2 years ago
jimysancho / graphrag-psql
View on GitHub
Implemention based on lightrag and nano-graphrag to connect with psql
☆15Oct 28, 2024Updated last year
rbfx / sample-project
View on GitHub
Simple example project that uses the Framework as submodule
☆14Updated this week
tsinghua-fib-lab / RoboScape
View on GitHub
☆26Jun 29, 2025Updated last year
zulov / artofwar
View on GitHub
RTS game inspired by AOE. using herd algorithm and crowd dynamics, Genetic algorithms, Neural Network, Powered by Urho3D
☆13Updated this week
zhjohnchan / bert-clip-synesthesia
View on GitHub
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Jun 7, 2023Updated 3 years ago
bdsp-core / IIIC-SPaRCNet
View on GitHub
☆17Jul 6, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
toolworks-dev / trusty-notes
View on GitHub
A secure cross-platform note-taking application. Features end-to-end encryption for cloud sync and a modern React frontend.
☆14Jul 11, 2025Updated last year
guuguo / flutter_eliminating
View on GitHub
flutter 开发的消除小游戏，意图在于学习如何实现
☆12Apr 18, 2022Updated 4 years ago
MrDOS / freebsd-cross-build
View on GitHub
amd64 Linux docker container for cross-compilation to FreeBSD.
☆12May 22, 2024Updated 2 years ago
roahmlab / DEFORM
View on GitHub
DEFORM
☆20Mar 3, 2025Updated last year
lost22git / egui_code
View on GitHub
a code editor
☆13Apr 21, 2023Updated 3 years ago
lntzm / HICom
View on GitHub
[CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
☆21Apr 30, 2025Updated last year
MathGenie / MathGenie
View on GitHub
☆14Mar 11, 2024Updated 2 years ago
Yaser-wyx / SCANet
View on GitHub
init
☆12May 25, 2025Updated last year
MCLAB-OCR / KnowledgeMiningWithSceneText
View on GitHub
☆38Feb 4, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zju3dv / PhysSkin
View on GitHub
[CVPR 2026 Highlight] PhysSkin: Real-Time and Generalizable Physics-Based Animation via Self-Supervised Neural Skinning
☆34Apr 9, 2026Updated 3 months ago
2U1 / Pixtral-Finetune
View on GitHub
An open-source implementaion for fine-tuning Pixtral by MistralAI.
☆23Feb 5, 2025Updated last year
Arking1995 / COHO
View on GitHub
[ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
☆13Aug 13, 2024Updated last year
dblanm / benchmarking_cloth
View on GitHub
Code of the article "Benchmarking the Sim-to-Real Gap in Cloth Manipulation"
☆24May 26, 2025Updated last year
Khalil-Rehman9 / CaptionAI
View on GitHub
A powerful and user-friendly tool that generates detailed captions for your images
☆21Nov 11, 2024Updated last year
DAMO-NLP-SG / Multipurpose-Chatbot
View on GitHub
A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)
☆20Apr 18, 2024Updated 2 years ago
jingjing-you / GRCNN_OCR.pytorch
View on GitHub
Gated Recurrent Convolution Neural Network in Pytorch
☆25Nov 12, 2018Updated 7 years ago
ShawnTan86 / TokenCarve
View on GitHub
This is the open-source code for TokenCarve.
☆25Jan 23, 2026Updated 6 months ago
kfish / impop
View on GitHub
ImPop: Useful utils for Dear ImGui. Includes compile-time palette generation, ConfigMenu, DatePicker, OutlineText, PerfFooter
☆18Sep 9, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
N8python / binary-vectors-mlx
View on GitHub
MLX binary vectors and associated algorithms.
☆14Mar 13, 2025Updated last year
agentsea / taskara
View on GitHub
Task management for AI agents
☆17Jun 25, 2025Updated last year
Canjie-Luo / Real-300K
View on GitHub
The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…
☆34Jun 21, 2022Updated 4 years ago
BruceDLong / Proteus
View on GitHub
Proteus 2.0
☆10Updated this week
diogohmcruz / DeepDip
View on GitHub
DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA
☆13Jul 22, 2019Updated 7 years ago
rust-cv / hgg
View on GitHub
Hierarchical Greedy Graph
☆16Jul 3, 2022Updated 4 years ago
liuting20 / MustDrop
View on GitHub
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
☆36Jan 8, 2025Updated last year