☆16Dec 25, 2021Updated 4 years ago
Alternatives and similar repositories for Qc-TextCap
Users that are interested in Qc-TextCap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Nov 30, 2021Updated 4 years ago
- ☆24Oct 8, 2023Updated 2 years ago
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Apr 5, 2022Updated 4 years ago
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Mar 29, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This data-set includes 10480 images including three folders namely Accident –Detection, Vehicles-in-Accidents and Accident-Severity. The …☆29Nov 15, 2018Updated 7 years ago
- A neural text style transfer model☆12Jun 23, 2019Updated 6 years ago
- ☆10Jul 27, 2018Updated 7 years ago
- Source code for the NAACL 2021 paper: Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation☆15Jul 19, 2021Updated 4 years ago
- NLP command-line assistant powered by OpenAI☆21Jan 27, 2024Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆19Feb 22, 2021Updated 5 years ago
- Pytorch implementation of "Enhancing Chinese Pre-trained Language Model via Heterogeneous Linguistics Graph", ACL 2022☆15Feb 28, 2022Updated 4 years ago
- ☆44Aug 2, 2021Updated 4 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Dec 19, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Jul 23, 2020Updated 5 years ago
- PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".☆23Sep 19, 2021Updated 4 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 5 years ago
- English-Thai Machine Translation with OPUS data☆19Feb 10, 2020Updated 6 years ago
- Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client to server for live transcription and optio…☆16Mar 11, 2026Updated 2 months ago
- Document Visual Question Answering☆131Jul 30, 2020Updated 5 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 3 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- 在TableBank的基础上,进一步标注到单元格精度,利用目标检测/分割实现单元格定位。☆14Dec 11, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GMEG☆31Nov 21, 2024Updated last year
- natual language guided image captioning☆88Feb 11, 2024Updated 2 years ago
- Fine grained annotations extending hateful memes dataset with additional labels for identifying protected categories and attack types.☆26Aug 24, 2021Updated 4 years ago
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆30Oct 27, 2023Updated 2 years ago
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated 2 months ago
- ☆16Dec 22, 2021Updated 4 years ago
- ☆68Sep 7, 2023Updated 2 years ago
- CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)☆36Nov 12, 2022Updated 3 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35May 2, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 4 years ago
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆56Oct 30, 2024Updated last year
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 7 months ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆106Mar 31, 2025Updated last year
- ☆13Apr 21, 2024Updated 2 years ago
- Phát triển ứng dụng web☆14Jan 7, 2022Updated 4 years ago