tinyvision / SOLIDERLinks
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
☆1,463Updated 2 years ago
Alternatives and similar repositories for SOLIDER
Users that are interested in SOLIDER are comparing it to the libraries listed below
Sorting:
- Real-time and accurate open-vocabulary end-to-end object detection☆1,349Updated 10 months ago
- [CVPR'23] Universal Instance Perception as Object Discovery and Retrieval☆1,277Updated 2 years ago
- Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥☆467Updated last year
- DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, Aligned…☆3,114Updated last year
- Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,327Updated 3 weeks ago
- Large-Scale Visual Representation Model☆699Updated last month
- [ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"☆461Updated last year
- A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results☆460Updated 3 years ago
- ☆1,372Updated last year
- Create textures for 3d models using stable-diffusion and blender☆585Updated 2 years ago
- [CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"☆428Updated 5 months ago
- 🔥 🔥 🔥 [NeurIPS 2024] Official Implementation of Hawk: Learning to Understand Open-World Video Anomalies☆222Updated 7 months ago
- The first open autoregressive foundational video AI model.☆2,897Updated last year
- A powerful baseline for image classification, face recognition and image retrieval with Pytorch☆583Updated last week
- This is an open source project that can track and segment specific objects in video streams by manual clicks, box selections, or text pro…☆140Updated this week
- [ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for …☆1,361Updated last year
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆85Updated 2 weeks ago
- Structure your STEM essay in several minutes with Generative AI.☆706Updated last year
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,132Updated 2 months ago
- The interface allowing users to submit Solana order intents for consumption by the Urani Protocol.☆385Updated last year
- A window.fetch JavaScript polyfill.☆686Updated last year
- Orderbook PoC for the demo day at the Solana Colosseum Accelerator (This is not the Urani Protocol).☆264Updated last year
- [CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"☆226Updated last year
- [CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception☆598Updated last year
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆161Updated 8 months ago
- [NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"☆1,078Updated last year
- [NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆104Updated 7 months ago
- ☆161Updated last year
- NodeJS module to create polyfill bundles tailored to individual user-agents.☆675Updated last year
- Secure storage system based on cryptography|基于密码学的安全存储系统☆851Updated 8 months ago