198808xc/Vision-AGI-Survey

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/198808xc/Vision-AGI-Survey)

198808xc / Vision-AGI-Survey

A temporary webpage for our survey in AGI for computer vision

☆119

Alternatives and similar repositories for Vision-AGI-Survey

Users that are interested in Vision-AGI-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wdrink / OpenTokenizer
View on GitHub
☆21Jan 17, 2025Updated last year
Corleone-Huang / RealCustomProject
View on GitHub
☆19Apr 16, 2025Updated last year
ICCV-5-EENA / EENA
View on GitHub
☆10Jul 5, 2019Updated 7 years ago
callsys / FlowText
View on GitHub
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
☆13May 13, 2023Updated 3 years ago
jianzongwu / Awesome-Open-Vocabulary
View on GitHub
(TPAMI 2024) A Survey on Open Vocabulary Learning
☆999May 12, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
caopulan / CVPR24_Listener
View on GitHub
☆12Feb 2, 2024Updated 2 years ago
sunsmarterjie / ChatterBox
View on GitHub
[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues
☆61May 2, 2025Updated last year
MengLcool / SEGIC
View on GitHub
[ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".
☆27Oct 13, 2024Updated last year
zipengxuc / StylerDALLE
View on GitHub
Code for ICCV 2023 paper ✨ "StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Mo…
☆18Jan 25, 2024Updated 2 years ago
GGQ1996 / action_co_localization
View on GitHub
☆21Sep 12, 2020Updated 5 years ago
davzha / MESH
View on GitHub
Code for the paper "Unlocking Slot Attention by Changing Optimal Transport Costs"
☆13Sep 19, 2023Updated 2 years ago
laisimiao / LoRAT_pytracking
View on GitHub
LoRAT_pytracking: reproduction of [ECCV2024] LoRAT
☆47Dec 9, 2024Updated last year
cmusmashlab / SAMoSA
View on GitHub
Code for the paper, SAMoSA - Sensing Activities with Motion and Sub-sampled Audio
☆19Jan 24, 2023Updated 3 years ago
HelloJianHan / msrlOB
View on GitHub
Pytorch reproduction of paper "Hierarchical Object Detection with Deep Reinforcement Learning"
☆10Oct 3, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
awaisrauf / Awesome-CV-Foundational-Models
View on GitHub
☆550Nov 7, 2024Updated last year
open-mmlab / mmstyles
View on GitHub
Latex style file to facilitate writing of technical papers
☆37Apr 4, 2016Updated 10 years ago
song2yu / SIBench-VSR
View on GitHub
This is a project on visual spatial reasoning tasks-SIBench
☆27Jan 12, 2026Updated 6 months ago
WeiZhang1988 / BEVFormerReimplementation
View on GitHub
☆16May 14, 2024Updated 2 years ago
ExplainableML / ACVC
View on GitHub
Official PyTorch implementation of CVPRW 2022 paper "Attention Consistency on Visual Corruptions for Single-Source Domain Generalization"
☆29Feb 22, 2023Updated 3 years ago
opendatalab / image-downloader
View on GitHub
☆31May 13, 2024Updated 2 years ago
mira-space / MiraData
View on GitHub
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
☆527Sep 2, 2024Updated last year
VainF / Awesome-Anything
View on GitHub
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
☆1,852Nov 15, 2023Updated 2 years ago
yoxu515 / MITS
View on GitHub
☆21Jul 25, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Exploring-Embodied-Emotion-official / E3
View on GitHub
☆25Jul 1, 2025Updated last year
baaivision / Painter
View on GitHub
Painter & SegGPT Series: Vision Foundation Models from BAAI
☆2,593Dec 6, 2024Updated last year
tailin1009 / DualHead-Network
View on GitHub
The implementation for "Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition"(ACM Multimedia 2021)
☆22Nov 8, 2022Updated 3 years ago
ekurtulus / tied-augment
View on GitHub
Tied-Augment: Controlling Representation Similarity Improves Data Augmentation
☆14Oct 1, 2023Updated 2 years ago
ZFancy / DivOE
View on GitHub
[NeurIPS 2023] "Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation"
☆11Oct 6, 2023Updated 2 years ago
joyhsu0504 / LEFT
View on GitHub
☆50Apr 25, 2024Updated 2 years ago
joeyy5588 / planning-as-inpainting
View on GitHub
Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty
☆23Dec 11, 2023Updated 2 years ago
zhang-haojie / MuSS
View on GitHub
A Large-Scale Dataset and Cinematic Narrative Benchmark for Multi-Shot Subject-to-Video Generation
☆32Jun 9, 2026Updated last month
shengshu-ai / minWM
View on GitHub
A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models
☆749Jun 15, 2026Updated last month
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
fmcarlucci / ADAGE
View on GitHub
Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"
☆11Jul 10, 2019Updated 7 years ago
Phantom-video / Phantom-Data
View on GitHub
Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset
☆117Feb 25, 2026Updated 5 months ago
DirtyHarryLYL / LLM-in-Vision
View on GitHub
Recent LLM-based CV and related works. Welcome to comment/contribute!
☆871Mar 8, 2025Updated last year
liuyang-ict / SAP-DETR
View on GitHub
[CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…
☆30May 28, 2023Updated 3 years ago
chunfeng3364 / LARC
View on GitHub
☆19Jun 26, 2024Updated 2 years ago
OpenGVLab / LCL
View on GitHub
[NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
☆72Feb 11, 2025Updated last year
CSC2548 / image_caption_gan
View on GitHub
☆10May 4, 2018Updated 8 years ago