ssppp/Click4Caption

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ssppp/Click4Caption)

ssppp / Click4Caption

A visual LLM for image region description or QA.

☆16

Alternatives and similar repositories for Click4Caption

Users that are interested in Click4Caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Pzoom522 / xANLG
View on GitHub
Data and code for "Understanding Linearity of Cross-Lingual Word Embedding Mappings" (TMLR 2022)
☆12Jun 8, 2022Updated 4 years ago
tcapelle / mistral_wandb
View on GitHub
A full fledged mistral+wandb
☆13Aug 16, 2024Updated last year
juliawilkins / ambisonics2binaural_simple
View on GitHub
A simple Python script to convert FOA audio to binaural.
☆17Nov 29, 2022Updated 3 years ago
wandb / llm-workshop-fc2024
View on GitHub
Resources for the FC 2024 LLM workshop
☆17Jul 31, 2024Updated last year
lixiaotong97 / mc-BEiT
View on GitHub
[ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference …
☆22Sep 13, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
palchenli / VL-Instruction-Tuning
View on GitHub
☆90Nov 25, 2023Updated 2 years ago
searxng / fasttext-predict
View on GitHub
fasttext with wheels and no external dependency, but only the predict method (<1MB)
☆20Nov 23, 2024Updated last year
TencentARC / ConMIM
View on GitHub
Official codes for ConMIM (ICLR 2023)
☆58Feb 8, 2023Updated 3 years ago
folterj / BioImageOperation
View on GitHub
Image processing tool focusing on biological applications
☆14Jul 18, 2025Updated last year
scaleton-co / token-contract
View on GitHub
Fungible, Non-Fungible, Semi-Fungible Tokens Smart Contracts
☆20Apr 23, 2022Updated 4 years ago
xt4d / SparseGNV
View on GitHub
SparseGNV: Generating Novel Views of Indoor Scenes with Sparse Input Views
☆19Feb 27, 2024Updated 2 years ago
lecoursen / lecoursen
View on GitHub
☆27Mar 28, 2023Updated 3 years ago
hahehi / placepedia
View on GitHub
A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.
☆10Jul 15, 2020Updated 6 years ago
microsoft / ExtreMA
View on GitHub
A self-supervised learning approach based on extremely large masking
☆31Dec 19, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ivan-alles / blender-dataset
View on GitHub
A python toolkit to create photorealistic image datasets for machine learning with Blender.
☆12Jun 14, 2021Updated 5 years ago
Skrrytch / map2mc
View on GitHub
Tool to generate a complete minecraft world out of a bitmap image (a map for example)
☆12Apr 26, 2021Updated 5 years ago
Skywola / anim
View on GitHub
Blender-Python Animation files
☆16Mar 5, 2021Updated 5 years ago
yekeren / Story-Video_ads_understanding
View on GitHub
LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".
☆15Oct 30, 2020Updated 5 years ago
detoxio-ai / hacktor
View on GitHub
☆25Jan 27, 2025Updated last year
kariander1 / visual-geo-solver
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
TencentARC / TVTS
View on GitHub
Turning to Video for Transcript Sorting
☆49Aug 27, 2023Updated 2 years ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
eigen-value / rigify
View on GitHub
Auto-rigging addon for Blender
☆19Dec 21, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
janinethoma / learning1M
View on GitHub
This repository contains the code for our papers "Learning Condition Invariant Features for Retrieval-Based Localization from 1M Images" …
☆11Oct 25, 2020Updated 5 years ago
mr-abramenko / subpixel-corner-edge-detector
View on GitHub
Subpixel corner and edge detector
☆19Jul 6, 2023Updated 3 years ago
sijeh / Sticker820K
View on GitHub
☆11Jun 12, 2023Updated 3 years ago
JosephPai / FashionAI-Attributes
View on GitHub
Attributes Recognition of Apparel
☆10Jan 8, 2019Updated 7 years ago
vguzov / hps_dataset_scripts
View on GitHub
Demo scripts for HPS Dataset (http://virtualhumans.mpi-inf.mpg.de/hps/)
☆11Mar 10, 2025Updated last year
showlab / Region_Learner
View on GitHub
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆43Jul 15, 2022Updated 4 years ago
ljjcoder / EHTDI
View on GitHub
Exploring High-quality Target Domain Information for Unsupervised Domain Adaptive Semantic Segmentation
☆20Nov 22, 2022Updated 3 years ago
TencentARC-QQ / TagGPT
View on GitHub
TagGPT: Large Language Models are Zero-shot Multimodal Taggers
☆67May 12, 2023Updated 3 years ago
lpzjerry / Pedestrian-Attribute-LGNet
View on GitHub
Source code for paper "Localization Guided Learning for Pedestrian Attribute Recognition" (BMVC 2018)
☆13Mar 1, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jasonyzhang / relpose
View on GitHub
Code for RelPose (ECCV 2022)
☆115Jun 1, 2023Updated 3 years ago
idiap / deepfocus
View on GitHub
Pytorch implementation of "DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function"
☆25Jul 5, 2023Updated 3 years ago
amazon-science / object-centric-vol
View on GitHub
☆13Apr 3, 2024Updated 2 years ago
jmhb0 / view_neti
View on GitHub
[ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models
☆111Dec 3, 2024Updated last year
showlab / DemoVLP
View on GitHub
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆22Mar 19, 2022Updated 4 years ago
yxgeee / SDA
View on GitHub
Structured Domain Adaptation with Online Relation Regularization for Unsupervised Person Re-ID
☆18Jun 9, 2020Updated 6 years ago
tgilewicz / uniformaugment
View on GitHub
Unofficial PyTorch Reimplementation of UniformAugment.
☆15Sep 7, 2020Updated 5 years ago