tsujuifu/pytorch_empirical-mvm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tsujuifu/pytorch_empirical-mvm)

tsujuifu / pytorch_empirical-mvm

A PyTorch implementation of EmpiricalMVM

☆41

Alternatives and similar repositories for pytorch_empirical-mvm

Users that are interested in pytorch_empirical-mvm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tsujuifu / pytorch_tvc
View on GitHub
A PyTorch implementation of TVC
☆24Dec 18, 2023Updated 2 years ago
tsujuifu / pytorch_violet
View on GitHub
A PyTorch implementation of VIOLET
☆138Dec 17, 2023Updated 2 years ago
tsujuifu / pytorch_bco
View on GitHub
A PyTorch implementation of BCO
☆12Jun 19, 2023Updated 3 years ago
microsoft / LAVENDER
View on GitHub
A Unified Framework for Video-Language Understanding
☆62Jun 17, 2023Updated 3 years ago
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
microsoft / SwinBERT
View on GitHub
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
☆250May 26, 2022Updated 4 years ago
YunseokJANG / tgif-qa
View on GitHub
Repository for our CVPR 2017 and IJCV: TGIF-QA
☆180Sep 6, 2021Updated 4 years ago
yj-yu / CiSIN
View on GitHub
Character Grounding and Re-Identification in Story of Videos and Text Descriptions
☆10Jan 17, 2021Updated 5 years ago
kevinlin311tw / METRO
View on GitHub
Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"
☆17May 22, 2021Updated 5 years ago
tsujuifu / pytorch_ldast
View on GitHub
A PyTorch implementation of LDAST
☆26Dec 17, 2023Updated 2 years ago
HanqingWangAI / SSM-VLN
View on GitHub
Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"
☆43Jul 31, 2021Updated 4 years ago
AlexGidiotis / DANCER-summ
View on GitHub
Code for the paper "A Divide-and-Conquer Approach to the Summarization of Long Documents"
☆18Jun 8, 2021Updated 5 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
Haawron / SLURM_allocated_gres_visualizer
View on GitHub
The app for visualizing allocated GPUs by SLURM
☆13Jan 21, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kodenii / ORES
View on GitHub
ORES: Open-vocabulary Responsible Visual Synthesis
☆14Dec 12, 2023Updated 2 years ago
marcusm117 / IdentityChain
View on GitHub
[ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
☆11Nov 24, 2025Updated 7 months ago
TalalWasim / Vita-CLIP
View on GitHub
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆126Jul 1, 2023Updated 3 years ago
tsujuifu / pytorch_sscr
View on GitHub
A PyTorch implementation of SSCR
☆23Aug 12, 2024Updated last year
mugen-org / MUGEN_baseline
View on GitHub
multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…
☆42Apr 1, 2023Updated 3 years ago
reallsp / SAF
View on GitHub
☆12Sep 6, 2023Updated 2 years ago
TencentARC / MCQ
View on GitHub
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
☆141Jul 20, 2022Updated 4 years ago
hdjsjyl / face-faster-rcnn.pytorch
View on GitHub
A face detection base on faster-rcnn.pytorch
☆10Feb 9, 2018Updated 8 years ago
Adit31 / Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
View on GitHub
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆13Jun 26, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
KimHyeonwoo / go-hangul
View on GitHub
A package for Hangul (korean alphabet)
☆13Dec 19, 2022Updated 3 years ago
ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago
kimyuji / EvolvingQA_benchmark
View on GitHub
Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)
☆10Oct 16, 2024Updated last year
gohsyi / PeerLoss
View on GitHub
Learning with Noisy Labels by adopting a peer prediction loss function.
☆35Mar 3, 2020Updated 6 years ago
KumarRobotics / kr_3d_active_ms_slam
View on GitHub
[RA-L 2024] 3D Active Metric-Semantic SLAM
☆17Jul 21, 2025Updated last year
zhengsipeng / VRDFormer_VRD
View on GitHub
☆17Jun 4, 2023Updated 3 years ago
bellos1203 / TCD
View on GitHub
Code for "Class-Incremental Learning for Action Recognition in Videos", ICCV 2021
☆22Oct 14, 2022Updated 3 years ago
tsujuifu / pytorch_vsum-ptr-gan
View on GitHub
A PyTorch implementation of VSumPtrGAN
☆39Dec 17, 2023Updated 2 years ago
jason2133 / CS224W
View on GitHub
Stanford CS224W: Machine Learning with Graphs (GNN)
☆12Sep 6, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ablodge / leamr
View on GitHub
A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…
☆16Dec 10, 2022Updated 3 years ago
mlfoundations / patching
View on GitHub
Patching open-vocabulary models by interpolating weights
☆91Sep 28, 2023Updated 2 years ago
P-bibs / Lobster
View on GitHub
Lobster: A GPU-Accelerated Framework for Neurosymbolic Programming
☆17Mar 26, 2026Updated 3 months ago
manzoku23 / M1-Pytorch-Tutorial
View on GitHub
Pytorch Tutorial for M1 students. This repository include Encoder Deocder model and Classification model building code.
☆12Jun 1, 2022Updated 4 years ago
famstack-dev / local-llm-bench
View on GitHub
Local LLM benchmark tool for comparing engines (MLX vs llama.cpp), scenarios on Apple Silicon
☆20Jun 19, 2026Updated last month
salesforce / ALPRO
View on GitHub
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
☆188May 1, 2025Updated last year
Feifannaner / awesome-human-action-recognition
View on GitHub
list the most popular methods about human action recognition
☆73Sep 26, 2019Updated 6 years ago