☆18Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for Official-ConvMAE-Det
Users that are interested in Official-ConvMAE-Det are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Feb 28, 2023Updated 3 years ago
- The multi-view version of MonoDETR on nuScenes dataset☆21Nov 4, 2022Updated 3 years ago
- [Codes of paper]: Region-based Non-local operation for Video Classification☆18Nov 28, 2021Updated 4 years ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Apr 2, 2023Updated 3 years ago
- ☆184Aug 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Jul 6, 2023Updated 2 years ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆85Aug 26, 2025Updated 10 months ago
- Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.☆14Oct 18, 2023Updated 2 years ago
- ☆57Oct 17, 2021Updated 4 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆529Mar 14, 2023Updated 3 years ago
- ☆19Sep 24, 2024Updated last year
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- Official repository of paper: "FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation"☆26Mar 2, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of "Single Shot Video Object Detector"☆23Mar 25, 2020Updated 6 years ago
- code of [CVPR22] CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance☆18Jul 10, 2022Updated 3 years ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆66Oct 16, 2024Updated last year
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- ☆170Oct 14, 2021Updated 4 years ago
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆20Aug 19, 2025Updated 10 months ago
- ☆61Jun 17, 2022Updated 4 years ago
- ☆37May 7, 2023Updated 3 years ago
- ☆70Jun 9, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆25Jun 24, 2021Updated 5 years ago
- 生僻字OCR识别优化训练☆16Feb 16, 2023Updated 3 years ago
- ☆16Mar 5, 2023Updated 3 years ago
- Unofficial Paddle implementation of "Swin Transformer V2: Scaling Up Capacity and Resolution"☆33Nov 28, 2021Updated 4 years ago
- Open-Source EDA workshop for RISC-V community☆12Jul 27, 2022Updated 3 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 4 years ago
- An Examination of the Compositionality of Large Generative Vision-Language Models☆19Apr 9, 2024Updated 2 years ago
- Transfer PaddlePaddle's codes to TensorLayerX's codes☆10Feb 10, 2023Updated 3 years ago
- ☆53May 3, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pytorch implementation for the paper: "RVCDet: Rethinking Voxelization and Classification for 3D Object Detection" [ICONIP-2022]☆13Apr 15, 2024Updated 2 years ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Mar 7, 2024Updated 2 years ago
- [NeurIPS'25 Spotlight🔥]Official implementation of Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Ma…☆36May 11, 2026Updated last month
- [AAAI 2024] ConceptBed Evaluations for Personalized Text-to-Image Diffusion Models☆25Jun 1, 2023Updated 3 years ago
- minRAG is a RAG system that starts from scratch, pursuing the ultimate simplicity and power. It consists of no more than 10,000 lines of …☆14Jun 4, 2026Updated 3 weeks ago
- Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing imag…☆566Apr 21, 2024Updated 2 years ago
- A digital twin of the city of Chicago along with automated sensors☆13Nov 14, 2019Updated 6 years ago