☆23Aug 9, 2021Updated 4 years ago
Alternatives and similar repositories for vizwiz-caption
Users that are interested in vizwiz-caption are comparing it to the libraries listed below
Sorting:
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Dec 12, 2023Updated 2 years ago
- Bottom-up features extractor implemented in PyTorch.☆72Dec 5, 2019Updated 6 years ago
- 双路视频拼接☆13Nov 13, 2022Updated 3 years ago
- Codebase for EA Modeling (for Transactions on Affective Computing paper)☆12Dec 8, 2022Updated 3 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- 一个支持跨模态大语言模型的webui. A chatbot webui that supports various multi-modal large language models☆11May 8, 2023Updated 2 years ago
- Dynamic mode decomposition in Python☆13Jun 9, 2015Updated 10 years ago
- Colored Kimia Path24 Dataset: Configurations and Benchmarks with Deep Embeddings☆10Jun 6, 2024Updated last year
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆18Dec 24, 2024Updated last year
- PyTorch Implementation of the paper "Probabilistic Abduction for Visual Abstract Reasoning via Learning Rules in Vector-symbolic Architec…☆10Sep 18, 2025Updated 5 months ago
- Quick start for kubernetes deployment☆10Sep 15, 2022Updated 3 years ago
- This code shows how to train a model in Amazon SageMaker using a custom loss function for a binary classification problem in which the co…☆13Feb 21, 2019Updated 7 years ago
- Tools to build knowledge graphs from multi-modal extractions☆12Apr 2, 2020Updated 5 years ago
- drafts of LSRs I intend to file, am filing, or have filed as a legislator☆11Feb 3, 2026Updated last month
- Forward Direct Feedback Alignment Algorithm☆10Oct 23, 2024Updated last year
- 使用信号量加锁的循环共享内存队列☆11Sep 9, 2019Updated 6 years ago
- ☆12Jun 20, 2023Updated 2 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10Apr 18, 2023Updated 2 years ago
- Implementation of the Object Relation Transformer for Image Captioning☆180Sep 17, 2024Updated last year
- Uni-PrevPredMap: Extending PrevPredMap to a Unified Framework of Prior-Informed Modeling for Online Vectorized HD Map Construction☆15May 5, 2025Updated 10 months ago
- 🍅 移动端部署,支持YOLOv5s、YOLOv4-tiny、MobileNetV2-YOLOv3-nano、Simple-Pose与Yolact模型,支持iOS、Android,使用NCNN框架。☆12Aug 20, 2020Updated 5 years ago
- ☆11Sep 16, 2019Updated 6 years ago
- ChartOCR, based on original repo.☆13Mar 22, 2023Updated 2 years ago
- SpringCloud微服务入门教程,包含Eureka注册发现、Config配置中心、BUS消息总线、FeignClient客户端 、Zuul网关、Hystrix服务熔断降级、Stream消息队列、Sleuth链路监控、Swagger文档的基本整合演示。☆11Aug 26, 2024Updated last year
- Pytorch implementation of Human-Level Control through Deep Reinforcement Learning☆11May 31, 2017Updated 8 years ago
- Dataset of lung ultrasound videos for research on AI-based medical image analysis☆17Nov 9, 2025Updated 4 months ago
- Running the most popular deep learning frameworks on Azure Batch AI☆25Jun 12, 2023Updated 2 years ago
- ☆11Nov 23, 2020Updated 5 years ago
- Single Image Dehazing: Dilated Squeeze-and-Excitation U-net (DSEU)☆10Dec 14, 2020Updated 5 years ago
- ☆14Oct 10, 2022Updated 3 years ago
- ☆10Jul 1, 2022Updated 3 years ago
- ☆14Jul 22, 2021Updated 4 years ago
- The imdb files with SBD-Trans OCR for TextVQA dataset.☆11Nov 30, 2021Updated 4 years ago
- ☆14Nov 2, 2023Updated 2 years ago
- VORNet: Spatio-temporally Consistent Video Inpainting for Object Removal, CVPRW 2019☆12Jul 18, 2019Updated 6 years ago
- Database of cinematographic data of real films through film annotations.☆14Aug 4, 2020Updated 5 years ago
- classify recapture images using laplacian filter and CNN network☆12Dec 20, 2019Updated 6 years ago
- Where is the emotion? Dissecting a multi-gap network for image emotion classification☆10Oct 18, 2020Updated 5 years ago