☆54Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for LPMDataset
Users that are interested in LPMDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An ambiguous subtitles dataset for visual scene-aware machine translation☆14Oct 17, 2022Updated 3 years ago
- running LayoutLMv2☆11Apr 27, 2022Updated 4 years ago
- ☆10Apr 7, 2024Updated 2 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆20Jun 2, 2025Updated 11 months ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Aug 26, 2021Updated 4 years ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…☆44Sep 16, 2022Updated 3 years ago
- Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)☆17Sep 19, 2021Updated 4 years ago
- ☆13Jun 4, 2020Updated 5 years ago
- Fairseq tutorial☆18May 18, 2022Updated 3 years ago
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆105Mar 31, 2025Updated last year
- A pipeline architecture for temporal segmentation of video lectures.☆12Sep 8, 2020Updated 5 years ago
- Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"☆10Dec 17, 2023Updated 2 years ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆75Jan 13, 2025Updated last year
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated 2 months ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- wav2vec2 asr with transformers☆16Oct 26, 2021Updated 4 years ago
- Source code for MWO2KG and Echidna: Constructing and Exploring Knowledge Graphs from Maintenance Data☆10Feb 13, 2023Updated 3 years ago
- Official implementation of POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples (NeurIPS 2021)☆14Aug 6, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23Nov 27, 2025Updated 5 months ago
- ☆12Dec 26, 2023Updated 2 years ago
- Supporting code for ReCEval paper☆32Sep 14, 2024Updated last year
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆24May 29, 2025Updated 11 months ago
- This script extracts the reviews from a given app store, it uses non-specific CSS selectors to prevent malfunctions in the future.☆10Oct 19, 2019Updated 6 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆22Mar 26, 2025Updated last year
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆63Sep 30, 2020Updated 5 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☯︎[ACMMM'22] Official PyTorch Implementation of Towards Unbiased Visual Emotion Recognition via Causal Intervention☆20Jul 20, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Capturing Homogeneous Influence among Students: Hypergraph Cognitive Diagnosis for Intelligent Education Systems. This paper has been pub…☆18Jul 13, 2024Updated last year
- ☆25Jun 14, 2021Updated 4 years ago
- Evolution-ary Reinforcement Learning☆12Apr 16, 2017Updated 9 years ago
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 2 years ago
- Code for the BEEU challenge winning paper.☆21Sep 5, 2022Updated 3 years ago
- Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…☆39Oct 14, 2023Updated 2 years ago
- In this notebook, I am updating NLP notebooks, and projects☆10Jun 29, 2023Updated 2 years ago