My notes for cmu15445 2022
☆14Feb 8, 2023Updated 3 years ago
Alternatives and similar repositories for CMU15445-2022-notes
Users that are interested in CMU15445-2022-notes are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆19Apr 16, 2024Updated last year
- [ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"☆27Sep 9, 2025Updated 6 months ago
- ☆29Dec 23, 2025Updated 2 months ago
- build vgg16 with pytorch 0.4.0 for classification of CIFAR datasets☆10Mar 31, 2019Updated 6 years ago
- This is a personal project crerated in 2021/7, done while participating the 2021 NUS SWS project. (Cluster: Visual Computing)☆10Dec 6, 2024Updated last year
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11May 26, 2024Updated last year
- ☆12Apr 26, 2023Updated 2 years ago
- ACL24☆11Jun 7, 2024Updated last year
- A test library for computing modular exponentiation in parallel using AVX-512 vector arithmetic☆12Dec 18, 2023Updated 2 years ago
- This is mini rpc depend on google protobuf.☆12Jul 24, 2019Updated 6 years ago
- ☆10Sep 29, 2023Updated 2 years ago
- ☆11May 30, 2016Updated 9 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 8 months ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 2 months ago
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- 世界树.☆15Dec 9, 2023Updated 2 years ago
- A Pytorch implementation of "An Efficient Unfolding Network with Disentangled Spatial-Spectral Representation for Hyperspectral Image Sup…☆13Jan 26, 2023Updated 3 years ago
- ☐ ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`☆10May 28, 2021Updated 4 years ago
- ☆13Aug 21, 2022Updated 3 years ago
- My personal vim/neovim configuration files, dotfiles, docs and other scripts.☆14Feb 22, 2026Updated 2 weeks ago
- Self-collected data for Masked Face recognition paper (300+ different participants)☆12Jul 13, 2023Updated 2 years ago
- SJTU | CS 2612, Programming Languages and Compilers, Fall 2023☆13Jan 9, 2024Updated 2 years ago
- Developing VLMs for expert-level performance in specific medical specialties☆22Apr 25, 2025Updated 10 months ago
- The Source Code for OmniVideoBench @ICLR 2026☆61Feb 12, 2026Updated 3 weeks ago
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆92Feb 6, 2026Updated last month
- 《Debunking C++ Myths》的非专业个人翻译☆21Jul 13, 2025Updated 7 months ago
- code for our paper "Attention Distillation: self-supervised vision transformer students need more guidance" in BMVC 2022☆17Oct 4, 2022Updated 3 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated last year
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 10 months ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 9 months ago
- [EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning☆36Oct 22, 2025Updated 4 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Feb 11, 2026Updated 3 weeks ago
- 该项目是一个跨平台的高性能C++11服务器,经过压力测试能 够实现5-7万的并发连接,支持客户端访问服务器中的图片,优化了内存使用率,实现基于跳表的KV型存储,能够实现数据的CRUD等功能,具有高响应、低延迟的服务器性能。☆22Aug 7, 2023Updated 2 years ago
- ☆20Apr 26, 2024Updated last year
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆17Jun 2, 2023Updated 2 years ago
- Dual-Stage Approach Toward Hyperspectral Image Super-Resolution☆21Apr 20, 2024Updated last year
- Code repository for "Post-pre-training for Modality Alignment in Vision-Language Foundation Models" (CVPR2025)☆38Jul 25, 2025Updated 7 months ago
- Streaming Video Instruction Tuning☆52Feb 25, 2026Updated last week