berniebear/Multi-HT100M

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/berniebear/Multi-HT100M)

berniebear / Multi-HT100M

☆53

Alternatives and similar repositories for Multi-HT100M

Users that are interested in Multi-HT100M are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hainow / MCTN
View on GitHub
☆49Feb 4, 2019Updated 7 years ago
jsenellart / papers
View on GitHub
This repo is containing notes and implementations for cherry-picked publications of my particular interest
☆12May 14, 2020Updated 6 years ago
zmykevin / UC2
View on GitHub
CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
☆34Nov 9, 2021Updated 4 years ago
VALUE-Leaderboard / DataRelease
View on GitHub
Data Release for VALUE Benchmark
☆30Feb 16, 2022Updated 4 years ago
formiel / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆20May 13, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
XL2248 / CPCC
View on GitHub
Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"
☆12Dec 17, 2021Updated 4 years ago
rowanz / merlot
View on GitHub
MERLOT: Multimodal Neural Script Knowledge Models
☆226Mar 15, 2022Updated 4 years ago
salesforce / ALPRO
View on GitHub
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
☆188May 1, 2025Updated last year
lyy1994 / reformer
View on GitHub
An NMT framework built on Joint Representation
☆12Feb 19, 2020Updated 6 years ago
roudimit / c2kd
View on GitHub
Code for the C2KD paper (ICASSP 2023)
☆20May 15, 2023Updated 3 years ago
lishunyao97 / Pun-GAN
View on GitHub
Pun-GAN: Generative Adversarial Network for Pun Generation (EMNLP 2019)
☆40Aug 19, 2019Updated 6 years ago
antoine77340 / S3D_HowTo100M
View on GitHub
S3D Text-Video model trained on HowTo100M using MIL-NCE
☆200Jul 3, 2020Updated 6 years ago
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
XL2248 / VHM
View on GitHub
Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"
☆18Sep 5, 2022Updated 3 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
e-bug / iglue
View on GitHub
[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"
☆49Dec 7, 2022Updated 3 years ago
ARIES-LM / GMNMT
View on GitHub
☆30Nov 3, 2020Updated 5 years ago
Cloud-CV / vilbert-multi-task
View on GitHub
12-in-1: Multi-Task Vision and Language Representation Learning Web Demo
☆35Dec 8, 2022Updated 3 years ago
applenob / tf_jieba
View on GitHub
Tensorflow Operation Wrapper of cppjieba (Chinese Word Segamentation)
☆10Oct 21, 2019Updated 6 years ago
bzhangGo / sltunet
View on GitHub
SLTUNET: A Simple Unified Model for Sign Language Translation (ICLR 2023)
☆39Jul 10, 2023Updated 3 years ago
yuewang-cuhk / awesome-vision-language-pretraining-papers
View on GitHub
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
☆1,159Aug 19, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ictnlp / STEMM
View on GitHub
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
☆35Oct 25, 2023Updated 2 years ago
Trunpm / TPT-for-VideoQA
View on GitHub
☆19Nov 25, 2022Updated 3 years ago
HenryHZY / VL-PET
View on GitHub
[ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"
☆53Sep 21, 2023Updated 2 years ago
microsoft / GEM
View on GitHub
☆25Jun 25, 2021Updated 5 years ago
wenz116 / DRFT
View on GitHub
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Oct 24, 2021Updated 4 years ago
JerryYLi / valhalla-nmt
View on GitHub
Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"
☆28Feb 19, 2023Updated 3 years ago
XMUDeepLIT / mc_tit
View on GitHub
Code for ACL 2023 paper: Exploring Better Text Image Translation with Multimodal Codebook
☆21Apr 19, 2026Updated 3 months ago
XMUDeepLIT / DCCN
View on GitHub
Code for "Dynamic Context-guided Capsule Network for Multimodal Machine Translation" （ACM MM2020）
☆42Jan 22, 2022Updated 4 years ago
intersun / LightningDOT
View on GitHub
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
☆72Nov 14, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆799Jun 30, 2021Updated 5 years ago
forwchen / HVTG
View on GitHub
Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"
☆17Aug 25, 2020Updated 5 years ago
UriSha / EmbeddinglessNMT
View on GitHub
The implementation of "Neural Machine Translation without Embeddings", NAACL 2021
☆33Jun 9, 2021Updated 5 years ago
sunzewei2715 / Graformer
View on GitHub
The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models
☆24Sep 22, 2021Updated 4 years ago
FingerRec / OA-Transformer
View on GitHub
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
☆61May 25, 2022Updated 4 years ago
TengdaHan / TemporalAlignNet
View on GitHub
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆122Oct 9, 2023Updated 2 years ago
JeongHun0716 / e-mvsr
View on GitHub
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)
☆20Mar 17, 2025Updated last year