dondongwon/LPMDataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dondongwon/LPMDataset)

dondongwon / LPMDataset

☆54

Alternatives and similar repositories for LPMDataset

Users that are interested in LPMDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sahanbull / context-agnostic-engagement
View on GitHub
This repository contains the VLEngagement dataset and the helper functions/ tools required to work with the dataset.
☆16Dec 3, 2021Updated 4 years ago
tatHi / optok
View on GitHub
☆10Aug 26, 2021Updated 4 years ago
papermsucode / mdmmt
View on GitHub
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Jun 28, 2021Updated 5 years ago
ylsung / vl-merging
View on GitHub
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
☆37Oct 11, 2023Updated 2 years ago
HYPJUDY / Sparkles
View on GitHub
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
☆45Jun 14, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
vl-illusion / GVIL
View on GitHub
Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"
☆15Jan 25, 2024Updated 2 years ago
nguyentthong / READ
View on GitHub
[AAAI’24 Main] READ: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Vi…
☆10Jan 24, 2025Updated last year
libeineu / fairseq_mmt
View on GitHub
This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…
☆43Sep 16, 2022Updated 3 years ago
lehduong / NPTM
View on GitHub
Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)
☆17Sep 19, 2021Updated 4 years ago
som-shahlab / med-nota
View on GitHub
☆15Jun 11, 2025Updated last year
de9uch1 / fairseq-tutorial
View on GitHub
Fairseq tutorial
☆18May 18, 2022Updated 4 years ago
XiaochenWang-PSU / MedHMP
View on GitHub
Source codes of the paper "Hierarchical Pretraining on Multimodal Electronic Health Records".
☆21Apr 10, 2024Updated 2 years ago
nttmdlab-nlp / SlideVQA
View on GitHub
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
☆106Mar 31, 2025Updated last year
eduardorochasoares / easytopic
View on GitHub
A pipeline architecture for temporal segmentation of video lectures.
☆12Sep 8, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
SeitaroShinagawa / CLIP-visualization
View on GitHub
Attention visualization in CLIP
☆17Dec 7, 2022Updated 3 years ago
nlp-tlp / mwo2kg-and-echidna
View on GitHub
Source code for MWO2KG and Echidna: Constructing and Exploring Knowledge Graphs from Maintenance Data
☆10Feb 13, 2023Updated 3 years ago
xu1998hz / SEScore
View on GitHub
This repo contains all the codes for SEScore implementation
☆15Mar 3, 2025Updated last year
qhgz2013 / HyperDNE
View on GitHub
☆10Jul 30, 2023Updated 2 years ago
google / spiqa
View on GitHub
Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]
☆76Jan 13, 2025Updated last year
JerryYLi / valhalla-nmt
View on GitHub
Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"
☆28Feb 19, 2023Updated 3 years ago
turingmotors / vlm-recipes
View on GitHub
☆20Aug 28, 2024Updated last year
zhiyuanhubj / Long_form_VideoQA
View on GitHub
[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering
☆18Oct 9, 2024Updated last year
com3dian / Grobidmonkey
View on GitHub
The grobidmonkey package is an open-source package designed for postprocessing GROBID outputs.
☆12Mar 27, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Jack-ZC8 / M3AV-dataset
View on GitHub
[ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
☆24May 29, 2025Updated last year
ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
wenkesj / evolution
View on GitHub
Evolution-ary Reinforcement Learning
☆12Apr 16, 2017Updated 9 years ago
LividWo / Revisit-MMT
View on GitHub
☆25Jun 14, 2021Updated 5 years ago
nalzok / test-time-label-shift
View on GitHub
Test-Time Label-Shift Adaptation
☆14May 24, 2023Updated 3 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
sairin1202 / SciXGen
View on GitHub
Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"
☆13Feb 14, 2022Updated 4 years ago
bojesomo / Weather4cast2021-SwinEncoderDecoder
View on GitHub
☆10Oct 20, 2021Updated 4 years ago
StanfordMIMI / villa
View on GitHub
[ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data
☆45Oct 15, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
stardust-coder / japanese-lm-med-harness
View on GitHub
☆11Oct 2, 2024Updated last year
filby89 / NTUA-BEEU-eccv2020
View on GitHub
Code for the BEEU challenge winning paper.
☆21Sep 5, 2022Updated 3 years ago
KMCS-NII / PDFNLT-1.0
View on GitHub
Tools for Natural Language Text aware PDF structure analysis
☆15Mar 11, 2022Updated 4 years ago
fabriziocarpi / RLdecoding
View on GitHub
Reinforcement Learning for Bit Flipping decoding of linear codes
☆14Sep 12, 2020Updated 5 years ago
StanfordMIMI / MedVAL
View on GitHub
Toward Expert-Level Medical Text Validation with Language Models
☆18Oct 23, 2025Updated 9 months ago
RuishanLiu / GAN-TSC
View on GitHub
☆11Oct 15, 2020Updated 5 years ago