Collected the world's best computer vision labs and lecture materials.
☆14Feb 23, 2025Updated last year
Alternatives and similar repositories for computer-vision-reference
Users that are interested in computer-vision-reference are comparing it to the libraries listed below
Sorting:
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated last month
- 洛曦 web字幕打印机,用于通过打字的方式,在web实现动态的字幕显示效果。 支持HTTP API,可以配合其他程序协同工作。☆12Apr 15, 2025Updated 11 months ago
- Professor and Group List of CS☆10Mar 12, 2024Updated 2 years ago
- 不断增长的IT解决方案合集。包括.NET,Python,Web,操作系统相关的,开发中遇到的问题的解决方案。☆12Apr 10, 2024Updated last year
- A cloth simulator based on CUDA and ARCSim.☆16Jan 4, 2026Updated 2 months ago
- flutter 百度OCR☆12Dec 24, 2019Updated 6 years ago
- 🌟 Datawhale 贡献者可视化平台,在线地址:https://mv.datawhale.cc/☆33Updated this week
- ☆110Feb 19, 2026Updated last month
- ☆11May 24, 2024Updated last year
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 4 months ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 3 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆15Mar 22, 2023Updated 3 years ago
- 送给终结了入门状态的你☆19Jan 29, 2025Updated last year
- An LLM leaderboard for stateful agents☆21Oct 16, 2025Updated 5 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- ☆10Nov 17, 2023Updated 2 years ago
- Official repository for "On the Multi-modal Vulnerability of Diffusion Models"☆16Jul 15, 2024Updated last year
- ☆11Apr 30, 2025Updated 10 months ago
- ☆16Mar 26, 2025Updated 11 months ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 5 months ago
- ☆21Jul 3, 2025Updated 8 months ago
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆19Updated this week
- An Open Open source class☆22Feb 27, 2026Updated 3 weeks ago
- Processed datasets that we have used in our research☆14Apr 30, 2020Updated 5 years ago
- Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"☆15Apr 20, 2025Updated 11 months ago
- Awesome list for High Performance Computing / Parallel Computing resources.☆12Sep 20, 2017Updated 8 years ago
- Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Pol…☆79Jan 26, 2026Updated last month
- Code release for VTW (AAAI 2025 Oral)☆66Nov 4, 2025Updated 4 months ago
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆31Mar 11, 2026Updated last week
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 5 months ago
- ☆20Sep 11, 2025Updated 6 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 4 months ago
- Official implementation of "Removing Batch Normalization Boosts Adversarial Training" (ICML'22)☆19Jul 20, 2022Updated 3 years ago
- Yuren 13B is an information synthesis large language model that has been continuously trained based on Llama 2 13B, which builds upon the…☆15Sep 25, 2023Updated 2 years ago