tinyvision / SOLIDER
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
☆1,438Updated last year
Alternatives and similar repositories for SOLIDER:
Users that are interested in SOLIDER are comparing it to the libraries listed below
- Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥☆450Updated 8 months ago
- [CVPR'23] Universal Instance Perception as Object Discovery and Retrieval☆1,259Updated last year
- Real-time and accurate open-vocabulary end-to-end object detection☆1,167Updated 2 months ago
- DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, Aligned…☆3,008Updated 8 months ago
- OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,231Updated 2 months ago
- A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results☆458Updated 3 years ago
- [ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"☆460Updated 9 months ago
- Create textures for 3d models using stable-diffusion and blender☆580Updated last year
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,004Updated 3 months ago
- ☆158Updated 4 months ago
- [CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"☆212Updated 4 months ago
- [ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for …☆1,325Updated last year
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆148Updated 4 months ago
- Official repository of MMGenBench☆119Updated 2 months ago
- ☆1,381Updated 4 months ago
- ☆90Updated 11 months ago
- [NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models☆96Updated 7 months ago
- [NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆99Updated 5 months ago
- Large-Scale Visual Representation Model☆566Updated this week
- Improving Generalist Model with Domain-Specific Experts☆82Updated last month
- [NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation☆92Updated 4 months ago
- Structure your STEM essay in several minutes with Generative AI.☆708Updated 5 months ago
- 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models☆1,708Updated this week
- A powerful baseline for image classification, face recognition and image retrieval with Pytorch☆480Updated last week
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆61Updated 3 weeks ago
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆932Updated 6 months ago
- [ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"☆35Updated last month
- Orderbook PoC for the demo day at the Solana Colosseum Accelerator (This is not the Urani Protocol).☆261Updated 5 months ago
- A window.fetch JavaScript polyfill.☆688Updated 11 months ago