serdaryildiz / MViT-TRView external linksLinks
Masked Vision Transformer for Text Recognition
☆11Nov 13, 2024Updated last year
Alternatives and similar repositories for MViT-TR
Users that are interested in MViT-TR are comparing it to the libraries listed below
Sorting:
- TRCaptionNet official repository☆13Jul 25, 2024Updated last year
- ENTIRe-ID☆25Jul 13, 2024Updated last year
- ☆10Oct 20, 2020Updated 5 years ago
- [ICPR 2024] Official repository of the paper "GenFormer - Generated Images are All You Need to Improve Robustness of Transformers on Smal…☆14Aug 30, 2024Updated last year
- A yolov5 based application, it uses the prediction results by yolov5 to activate the selected opencv built-in tracking algorithm.☆10Jul 24, 2020Updated 5 years ago
- Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)☆10Feb 2, 2024Updated 2 years ago
- ☆12Oct 17, 2024Updated last year
- ☆17May 17, 2024Updated last year
- This is a simple codebase to train a Visual Geolocalization model through image retrieval methods, using PyTorch Lightning and the PyTorc…☆14Jun 29, 2023Updated 2 years ago
- ☆13Apr 3, 2023Updated 2 years ago
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆21Aug 5, 2025Updated 6 months ago
- Lowering PyTorch's Memory Consumption for Selective Differentiation☆12Aug 29, 2024Updated last year
- A system which includes a pair of stereo-cameras for 3D reconstruction, object detection and depth analysis with the help of disparity ma…☆13Jul 29, 2019Updated 6 years ago
- Turkish-Sentence Encoder with Quick-Thought Vectors☆11Dec 15, 2019Updated 6 years ago
- Quickly set-up a Python environment for machine learning and data science projects☆16Nov 25, 2017Updated 8 years ago
- ☆17Dec 25, 2023Updated 2 years ago
- A library for managing datasets for cross-view geolocalization (CVGL).☆14Jul 22, 2023Updated 2 years ago
- ☆15Nov 26, 2019Updated 6 years ago
- FakePartsBench: 25K+ AI-generated videos with pixel- and frame-level annotations of full and partial deepfakes.☆25Aug 31, 2025Updated 5 months ago
- Source-free unsupervised domain adaptation for cross-modality abdominal multi-organ segmentation☆16Mar 19, 2023Updated 2 years ago
- Code to implement Restormer-Plus, the Runner-up Solution to the GT-RAIN Challenge (CVPR 2023 UG2+ Track 3)☆14Oct 11, 2024Updated last year
- This is the official repository for the paper "Learning Sequence Descriptor based on Spatio-Temporal Attention for Visual Place Recogniti…☆18Oct 9, 2023Updated 2 years ago
- Land Cover Segmentation with Sparse Annotations from Sentinel-2 Imagery☆19Jan 26, 2024Updated 2 years ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 3 weeks ago
- Heterogeneous Relational Complement for Vehicle Re-identification, ICCV 2021☆23Oct 10, 2021Updated 4 years ago
- ☆18May 15, 2023Updated 2 years ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 3 months ago
- Structured Domain Adaptation with Online Relation Regularization for Unsupervised Person Re-ID☆18Jun 9, 2020Updated 5 years ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆24Feb 25, 2025Updated 11 months ago
- A instagram bot that automatically post a randomized daily quote from a series of quote worthy people.☆16Oct 5, 2022Updated 3 years ago
- We created a topic modeling pipeline to evaluate different topic modeling algorithms, including their performance on short and long text,…☆21May 22, 2025Updated 8 months ago
- Run fast LLM Inference using Llama.cpp in Python☆19Jan 3, 2024Updated 2 years ago
- ☆17Oct 28, 2022Updated 3 years ago
- Using Kinect2 Depth Sensors To Train Neural Network For Object Detection & Interaction☆18Dec 10, 2017Updated 8 years ago
- Source free Single and Multi target Unsupervised Domain Adaptation☆19Feb 8, 2023Updated 3 years ago
- ☆24Sep 5, 2025Updated 5 months ago
- Translate Markdown files from one language to another using OpenAI's API while retaining original formatting. This Jupyter notebook token…☆23Oct 15, 2023Updated 2 years ago
- Two stream Faster-RCNN evaluated on NYU Depth V2 dataset for RGBD object detection task.☆19Mar 16, 2018Updated 7 years ago
- An implementation of Deeplabv3plus in TensorFlow2 for semantic land cover segmentation☆22Oct 28, 2023Updated 2 years ago