A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.
☆14Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for catvision
Users that are interested in catvision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated last year
- 包含目标检测前处理与后处理☆20Aug 24, 2021Updated 4 years ago
- ☆12Mar 7, 2019Updated 7 years ago
- ☆12Dec 26, 2021Updated 4 years ago
- ☆17Sep 2, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Nov 7, 2023Updated 2 years ago
- Omni inference in C/C++☆102Mar 23, 2026Updated last week
- ☆12Aug 25, 2017Updated 8 years ago
- TaskingAI Python Client☆21Jan 28, 2025Updated last year
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Sep 25, 2024Updated last year
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.☆13Mar 8, 2016Updated 10 years ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- A Fast PyTorch implementation for ICCV 19 paper "BMN: Boundary-Matching Network for Temporal Action Proposal Generation"☆10Jul 29, 2019Updated 6 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Dec 25, 2021Updated 4 years ago
- You are welcomed to join us!☆50Sep 27, 2020Updated 5 years ago
- CS663 course project☆13Nov 22, 2016Updated 9 years ago
- R files containing the code used to predict rugby world cup matches☆10Sep 18, 2015Updated 10 years ago
- High-performance control stack for Embodied AI powered by the OpenClaw ecosystem. Designed for high-dynamic platforms including Humanoids…☆28Feb 16, 2026Updated last month
- profile tools for pytorch nn models☆42Jan 11, 2021Updated 5 years ago
- A image processing project that produces face morphing videos☆11Jul 9, 2015Updated 10 years ago
- Convert scans of handwritten notes to beautiful, compact Images☆15Jun 21, 2022Updated 3 years ago
- Implementation Code for Paper: K. Zagoris and I. Pratikakis, Bio-Inspired Modeling for the Enhancement of Historical Handwritten Document…☆15Nov 24, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Progressive Attention Networks☆12Oct 25, 2016Updated 9 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆29Oct 16, 2025Updated 5 months ago
- interactive shader with tensorflowjs facemesh☆15Dec 7, 2022Updated 3 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- Single Shot Scene Text Retrieval, ECCV 2018. L. Gomez*, A. Mafla*, M. Rusiñol, D. Karatzas.☆68May 13, 2019Updated 6 years ago
- Neurocomputing "Deep Multi-Center Learning for Face Alignment"☆12Mar 28, 2020Updated 6 years ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated last month
- Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners☆11Oct 15, 2018Updated 7 years ago
- ☆14May 26, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- HCCR中文手写汉字识别 (网站在线实时推理)☆13Feb 2, 2023Updated 3 years ago
- A papers list in computer vision fields.☆13Jul 20, 2020Updated 5 years ago
- 内网穿透及端口转发工具☆10Apr 7, 2022Updated 3 years ago
- assembles images in a grid☆34Jan 30, 2020Updated 6 years ago
- A SOTA vision model built on top of llama3 8B.☆14May 28, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆15Jan 22, 2025Updated last year
- Converting docx files to pdf using libreoffice engine, flask and docker☆18Feb 2, 2026Updated last month