This project is an unofficial summary of the resources related to VALSE and its annual seminar. Its main purpose is to more facilitate your communication and learning, and we also welcome your additions and suggestions.
☆20Jun 16, 2024Updated last year
Alternatives and similar repositories for VALSE-WorkShop
Users that are interested in VALSE-WorkShop are comparing it to the libraries listed below
Sorting:
- Vision-Language Pretraining & Efficient Transformer Papers.☆15Nov 30, 2021Updated 4 years ago
- Does Diffusion Beat GAN in Image Super Resolution?☆12May 27, 2024Updated last year
- ☆12Jun 2, 2019Updated 6 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated last year
- ☆14Dec 25, 2024Updated last year
- ☆19Jul 14, 2024Updated last year
- ☆11Apr 30, 2025Updated 10 months ago
- Collected the world's best computer vision labs and lecture materials.☆14Feb 23, 2025Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆16Mar 26, 2025Updated 11 months ago
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year
- The project is an official implementation of our paper " RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation…☆10Dec 9, 2020Updated 5 years ago
- The implementation of the Block Coordinate Regularization by Denoising (BC-RED) algorithm (NeurIPS 2019)☆10Oct 15, 2019Updated 6 years ago
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020☆13May 2, 2022Updated 3 years ago
- ☆81Sep 21, 2025Updated 6 months ago
- Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"☆15Apr 20, 2025Updated 11 months ago
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)☆17Jun 23, 2023Updated 2 years ago
- SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation☆76Updated this week
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆31Mar 11, 2026Updated last week
- SiamAtt: Siamese attention network for visual tracking☆15Apr 29, 2021Updated 4 years ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆40May 30, 2025Updated 9 months ago
- Yuren 13B is an information synthesis large language model that has been continuously trained based on Llama 2 13B, which builds upon the…☆15Sep 25, 2023Updated 2 years ago
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 8 months ago
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 2 years ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆25Feb 14, 2026Updated last month
- Code implementation of "HAMUR: Hyper Adapter for Multi-Domain Recommendation" in CIKM‘2023☆22Jan 3, 2024Updated 2 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆12Jan 3, 2025Updated last year
- 百度语音合成☆20Oct 12, 2017Updated 8 years ago
- Just a template for quickly creating a python library.☆10Updated this week
- 🌏 Teddy is a tiny but scalable http server based on Java NIO, inspired by netty.☆11Dec 26, 2019Updated 6 years ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- ☆12Feb 15, 2023Updated 3 years ago
- [AAAI'26] PyTorch code for our paper "QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution"☆32Jan 29, 2026Updated last month
- [ICCV 2023] PATMAT Person Aware Tuning of Mask Aware Transformer for Face Inpainting☆29Jan 5, 2024Updated 2 years ago
- Continuously Masked Transformer for Image Inpainting, ICCV, 2023☆28Nov 20, 2023Updated 2 years ago
- something for paper agent☆11Dec 18, 2024Updated last year