caopulan / iKUNet
☆142Updated last year
Alternatives and similar repositories for iKUNet:
Users that are interested in iKUNet are comparing it to the libraries listed below
- CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative H…☆309Updated 11 months ago
- A temporary webpage for our survey in AGI for computer vision☆120Updated 11 months ago
- [CVPR 2024] iKUN: Speak to Trackers without Retraining☆123Updated 9 months ago
- Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning☆110Updated 7 months ago
- 针对新的视频后期工作流制作的各种小工具☆20Updated 3 months ago
- A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue☆242Updated 3 months ago
- 抢占显卡☆65Updated 5 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆96Updated last year
- Simple script to parallelize download and extract files for SA-1B Dataset.☆36Updated 5 months ago
- a thin wrapper of chatgpt for improving paper writing.☆254Updated 2 years ago
- Collection of Highlight papers☆36Updated 10 months ago
- Latex template for conference (e.g. ICCV/CVPR) submission/supplementary/rebuttal☆69Updated last year
- Chat about anything on any video!☆36Updated last year
- 国内外优秀的计算机视觉团队汇总,极市团队整理☆288Updated 4 years ago
- 【CVer出品】旨在盘点最全面的计算机视觉方向☆36Updated 2 years ago
- AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segm…☆80Updated 3 months ago
- Towards training VQ-VAE models robustly!☆65Updated 3 months ago
- ☆67Updated 5 months ago
- 一款便捷的抢占显卡脚本☆315Updated 2 months ago
- ☆116Updated 10 months ago
- A template for rapid deployment of PyTorch models.☆64Updated 2 years ago
- Which fellows cited my article?☆23Updated 3 years ago
- ☆8Updated 3 weeks ago
- A python tool that generate latex(e.g. Table, matrix) code.☆11Updated 2 years ago
- AI-Generated Images as Data Source: The Dawn of Synthetic Era☆152Updated last year
- Eat your GPUs☆21Updated 4 years ago
- [CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding☆354Updated 4 months ago
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆239Updated last year
- This repository is a curated collection of the most exciting and influential CVPR 2023 opensource works [Paper + Code].🔥☆62Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆302Updated last month