[ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection
☆12Dec 13, 2024Updated last year
Alternatives and similar repositories for s-multimae
Users that are interested in s-multimae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the DocLLM paper for Llama models.☆13Apr 6, 2025Updated 11 months ago
- Vietnamese handwritten text recognition system☆18May 2, 2021Updated 4 years ago
- a dataset for camera-based table detection☆16Jul 30, 2021Updated 4 years ago
- ☆19Mar 10, 2023Updated 3 years ago
- Create TensorRT-runtime for vietocr☆12Jun 8, 2021Updated 4 years ago
- Reinforcement Learning using PyTorch☆11Jan 18, 2024Updated 2 years ago
- Path visualizer using vanilla javascript☆16Jan 23, 2025Updated last year
- Lite-HRNet: A Lightweight High-Resolution Network☆22Oct 6, 2022Updated 3 years ago
- PyTorch implementation of "Pyramid Scene Parsing Network".☆16Nov 7, 2021Updated 4 years ago
- Play games in the OpenAI gym using the keyboard☆16Nov 21, 2017Updated 8 years ago
- This repository contains a set of scripts created by GPT-4, an advanced AI language model by OpenAI. The project demonstrates how AI can …☆20Mar 20, 2023Updated 3 years ago
- Asynchronous MongoDB client based on asyncio☆15Nov 18, 2016Updated 9 years ago
- A TensorFlow implementation of hybird CNN-LSTM model with CTC loss for OCR problem☆33Mar 11, 2019Updated 7 years ago
- A simplified Factorio clone developed in Python. A game that involves supply chain optimization☆24Mar 10, 2025Updated last year
- [TIP 23] Official implementation of HiDAnet: RGB-D Salient Object Detection via Hierarchical Depth Awareness☆24Aug 21, 2023Updated 2 years ago
- An official repository for "TypeDance: Creating Semantic Typographic Logos from Image through Personalized Generation." (https://arxiv.or…☆23Mar 13, 2024Updated 2 years ago
- ☆67Feb 8, 2024Updated 2 years ago
- Handwritten OCR using CRNN☆15Feb 8, 2018Updated 8 years ago
- ☆18Jun 24, 2024Updated last year
- Official implementation of the ANLS* metric☆22Mar 11, 2026Updated 2 weeks ago
- Home Action Genome: Cooperative Contrastive Action Understanding☆22Nov 8, 2021Updated 4 years ago
- [ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning☆73Dec 17, 2025Updated 3 months ago
- Convert xml/json from labelme to binary images☆27Feb 28, 2019Updated 7 years ago
- Based on BrainTransformers, BrainGPTForCausalLM is a Large Language Model (LLM) implemented using Spiking Neural Networks (SNN). We are e…☆32Oct 22, 2024Updated last year
- This is the official github repository of "An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions", …☆37Sep 28, 2025Updated 5 months ago
- Code and experiment data for ICDM'19 paper, tabular cell classification using pre-trained cell embeddings. Note that the code and data is…☆29Jul 6, 2023Updated 2 years ago
- Pre-trained Word2Vec models for Vietnamese☆161Dec 30, 2020Updated 5 years ago
- This repository is created to share current progress of transformer based optical character recognition(OCR). Welcome to share~☆55Oct 9, 2023Updated 2 years ago
- This is simple code of SpikedAttention (Neurips 2024)☆23Mar 30, 2025Updated 11 months ago
- ☆21Aug 23, 2023Updated 2 years ago
- ☆37Oct 9, 2025Updated 5 months ago
- [ACM MM 2021] Diverse Image Inpainting with Bidirectional and Autoregressive Transformers☆33Oct 17, 2021Updated 4 years ago
- Implementation for ECCV 2020 paper: AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling.☆30Aug 23, 2020Updated 5 years ago
- This is the implentation of our paper "SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking" i…☆44Mar 2, 2025Updated last year
- An implementation of PSPNet: Pyramid Scene Parsing Network, CVPR2017☆22May 14, 2018Updated 7 years ago
- The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.☆90Jun 18, 2025Updated 9 months ago
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆34Jan 3, 2024Updated 2 years ago
- CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts☆163Jun 8, 2024Updated last year
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆154May 14, 2025Updated 10 months ago