yeluo1994 / DSBILinks
Double-Sided Braille Image Dataset
☆26Updated 5 years ago
Alternatives and similar repositories for DSBI
Users that are interested in DSBI are comparing it to the libraries listed below
Sorting:
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 4 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Updated 4 years ago
- Solution of Kaggle competition: Feedback Prize - Evaluating Student Writing☆16Updated 3 years ago
- ☆27Updated 10 months ago
- ☆22Updated 4 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 3 years ago
- ☆44Updated 4 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆18Updated last year
- Official Github Repo for the Findings of EMNLP 2021 paper "An animated picture says at least a thousand words: Selecting Gif-based Replie…☆32Updated 4 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Synthesize image datasets of documents in natural scenes with Python+Blender3D☆59Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 4 years ago
- A collection of models for image<->text generation in ACM MM 2021.☆67Updated 4 years ago
- The Transformer in PyTorch☆13Updated last year
- Implementing DropPath/StochasticDepth in PyTorch☆17Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Updated this week
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Updated last year
- Benchmarking algorithms for assessing quality of data labeled by multiple annotators☆34Updated last month
- Implementation of the DocLLM paper for Llama models.☆13Updated 9 months ago
- A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"☆83Updated 3 years ago
- ☆15Updated 3 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated last year
- Large Scale BERT Distillation☆33Updated 2 years ago
- Official PyTorch code for U-Noise: Learnable Noise Masks for Interpretable Image Segmentation (ICIP 2021)☆42Updated 4 years ago
- ☆14Updated 3 years ago
- ☆66Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- ☆147Updated 2 years ago
- ☆16Updated 2 years ago