[ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference on Computer Vision (ECCV) 2022.
☆22Sep 13, 2022Updated 3 years ago
Alternatives and similar repositories for mc-BEiT
Users that are interested in mc-BEiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jul 20, 2022Updated 3 years ago
- Official codes for ConMIM (ICLR 2023)☆58Feb 8, 2023Updated 3 years ago
- Turning to Video for Transcript Sorting☆49Aug 27, 2023Updated 2 years ago
- A visual LLM for image region description or QA.☆16Jul 14, 2023Updated 2 years ago
- A benchmark suite for Scalable Diverse Model Selection for Accessible Transfer Learning from our NeurIPS 2021 paper.☆15Dec 14, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Matlab codes of GTH☆11Apr 18, 2019Updated 6 years ago
- Tensorflow implementation of a supervised approach to learn highly compressed image representations☆26Nov 22, 2017Updated 8 years ago
- [ICLR 2022] Official pytorch implementation of "Uncertainty Modeling for Out-of-Distribution Generalization" in International Conference …☆162Mar 27, 2022Updated 4 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 4 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆159Dec 6, 2024Updated last year
- ☆21Aug 16, 2021Updated 4 years ago
- PyTorch implementation of Hessian-Affine local feature detector☆27Nov 22, 2017Updated 8 years ago
- A curated list of papers and resources for text-to-image evaluation.☆30Sep 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A much powerful probing method to tune your model with promising performance and linear probing training cost!☆15Jul 26, 2023Updated 2 years ago
- Masked Surfel Prediction for Self-Supervised Point Cloud Learning☆27Dec 6, 2023Updated 2 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆110Dec 8, 2023Updated 2 years ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆197Jan 11, 2023Updated 3 years ago
- Free-form Description-guided 3D Visual Graph Networks for Object Grounding in Point Cloud☆17Jun 23, 2022Updated 3 years ago
- ☆12Sep 6, 2023Updated 2 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- Matlab implementation of "K-Nearest Neighbors Hashing" (CVPR2019)☆27Jun 15, 2019Updated 6 years ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆21Sep 24, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12May 19, 2023Updated 2 years ago
- ☆22Jul 25, 2018Updated 7 years ago
- ☆12Jul 18, 2024Updated last year
- Code for the CVPR'19 paper "Explore-Exploit Graph Traversal for Image Retrieval"☆31Aug 15, 2020Updated 5 years ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 8 months ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆15Oct 30, 2020Updated 5 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Jupyter notebooks for cloud-based usage☆10Aug 26, 2023Updated 2 years ago
- [VisDA2020 1st Place] Our solution to Domain Adaptive Pedestrian Re-identification in VisDA2020☆57Oct 12, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Mar 16, 2022Updated 4 years ago
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Dec 2, 2021Updated 4 years ago
- A Multitask Conversational Vision-Language Model for Radiology☆16Jul 3, 2025Updated 8 months ago
- Code release for "MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos"(CVPR2023)☆14Dec 14, 2023Updated 2 years ago
- Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)☆19Mar 9, 2024Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆108Jul 24, 2023Updated 2 years ago