om-ai-lab / OmDet
Real-time and accurate open-vocabulary end-to-end object detection
☆1,482Updated last week
Related projects: ⓘ
- Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥☆567Updated 3 months ago
- OMG-LLaVA and OMG-Seg codebase☆1,222Updated last month
- A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, wh…☆1,899Updated last year
- [ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"☆517Updated 4 months ago
- ☆106Updated 5 months ago
- [CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"☆260Updated last month
- [CVPR'23] Universal Instance Perception as Object Discovery and Retrieval☆1,488Updated last year
- Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/☆526Updated 3 months ago
- An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,220Updated last month
- Tiny3D is a next generation of 3D AI service production system.☆601Updated last year
- ☆1,960Updated 2 months ago
- A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results☆574Updated 2 years ago
- Comprehensive Deep Learning Tutorial : From Zero To Hero☆806Updated last month
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆902Updated last month
- Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms…☆565Updated last month
- [NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆132Updated last year
- Matryoshka Query Transformer for Large Vision-Language Models☆88Updated 2 months ago
- Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,086Updated 10 months ago
- Generative Neural Methods Based On Model Iteration☆516Updated last year
- A powerful baseline for image classification and face recognition with Pytorch☆543Updated last week
- PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.☆1,114Updated 2 months ago
- ☆1,014Updated this week
- ☆108Updated 8 months ago
- [ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for …☆1,422Updated 7 months ago
- Visualize the PaddleOCR2PyTorch project online to make the PaddleOCR experience and deployment easier.☆27Updated last year
- This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral☆445Updated last month
- CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平…☆408Updated this week
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆754Updated last week
- Evaluating dynamics capability of T2V generation models with DEVIL protocols.☆321Updated 3 weeks ago
- GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation☆575Updated 7 months ago