Hawkeye-FineGrained / Hawkeye
Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥
☆454Updated 9 months ago
Alternatives and similar repositories for Hawkeye:
Users that are interested in Hawkeye are comparing it to the libraries listed below
- A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results☆457Updated 3 years ago
- A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, wh…☆1,441Updated last year
- Real-time and accurate open-vocabulary end-to-end object detection☆1,295Updated 2 months ago
- A powerful baseline for image classification, face recognition and image retrieval with Pytorch☆516Updated last week
- ☆209Updated last month
- Large-Scale Visual Representation Model☆609Updated 3 weeks ago
- [ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for …☆1,330Updated last year
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆151Updated this week
- ☆1,381Updated 5 months ago
- ☆160Updated 5 months ago
- Improving Generalist Model with Domain-Specific Experts☆85Updated 2 months ago
- OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,249Updated 3 months ago
- CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarch…☆242Updated last year
- Extended Agriculture-Vision Dataset: A continuous work of Agriculture-Vision, with great collaborators to bring Agriculture and Computer …☆245Updated 2 weeks ago
- A PyTorch Computer Vision (CV) module library for building n-D networks flexibly ~☆367Updated 5 months ago
- 🔥 🔥 🔥 [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies☆186Updated this week
- Official repository of MMGenBench☆119Updated this week
- [NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆100Updated 5 months ago
- [CVPR'23] Universal Instance Perception as Object Discovery and Retrieval☆1,264Updated last year
- DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, Aligned…☆3,025Updated 9 months ago
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆63Updated last month
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆128Updated 2 months ago
- ☆500Updated last month
- Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cros…☆468Updated last year
- Framework of fast implementation data processing and operating pipelines☆399Updated 7 months ago
- ☆149Updated 5 months ago
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆895Updated 3 weeks ago
- Open-Tax is an AI-powered cloud platform transforming tax compliance through automated data integration, real-time anomaly detection, and…☆402Updated last month