llp1992 / Kanva

☆11

Related projects: ⓘ

ziplab / SN-Netv2
[ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".
☆22Updated 7 months ago
liaoning97 / REVO-LION
REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets
☆11Updated 11 months ago
OpenGVLab / DiffAgent
[CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
☆15Updated 5 months ago
mightyzau / InfMLLM
☆20Updated 9 months ago
Vchitect / LiteGen
A light-weight and high-efficient training framework for accelerating diffusion tasks.
☆13Updated last week
983632847 / SAM-for-Videos
This repository is for the first survey on SAM for videos.
☆11Updated last month
HubHop / vit-attention-benchmark
Benchmarking Attention Mechanism in Vision Transformers.
☆16Updated last year
HDETR / H-PETR-Pose
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
☆14Updated 2 years ago
zhangjiewu / awesome-t2i-eval
A curated list of papers and resources for text-to-image evaluation.
☆26Updated last year
donglixp / ICL_PaperList
Paper List for In-context Learning 🌷
☆20Updated last year
mti-lab / SVGEditBench
A benchmark dataset for evaluating LLM's SVG editing capabilities
☆13Updated 4 months ago
buptlihang / CVLM
☆22Updated 8 months ago
V3Det / mmdetection-V3Det
OpenMMLab Detection Toolbox and Benchmark for V3Det
☆15Updated 5 months ago
luminolx / ScaleNet
ScaleNet: Searching for the Model to Scale (ECCV 2022)
☆12Updated last year
TencentARC / TaCA
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆15Updated last year
ggjy / vision_weak_to_strong
☆37Updated 7 months ago
WeihuangLin / INF-LLaVA
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆36Updated last month
MengLcool / DeepStack-VL
☆31Updated 3 months ago
jiyt17 / IDA-VLM
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
☆18Updated last week
IDSIA / fpainter
Official repository for the paper "Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules" (ICLR 2023)
☆12Updated last year
simonsanvil / DALL-E-Explained
Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…
☆29Updated 2 years ago
DefengXie / Edit_Everything
☆19Updated last year
eric-ai-lab / Discffusion
Official repo for the paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆26Updated 4 months ago
LaVi-Lab / Visual-Table
Stay tuned!
☆11Updated 5 months ago
jialuli-luka / SELMA
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆30Updated 6 months ago
bytedance / DQ-Det
Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
☆35Updated last year
NVlabs / DICOD
Official Pytorch implementation for Distilling Image Classifiers in Object detection (NeurIPS2021)
☆30Updated 2 years ago
neuralchen / Bivolution
Accepted by AAAI2022
☆21Updated 2 years ago
csqiangwen / LDM-ISP-Enhancing-Neural-ISP-for-Low-Light-with-Latent-Diffusion-Models
☆13Updated this week