yxuansu/MAGIC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yxuansu/MAGIC)

yxuansu / MAGIC

Language Models Can See: Plugging Visual Controls in Text Generation

☆261

Alternatives and similar repositories for MAGIC

Users that are interested in MAGIC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yxuansu / SimCTG
View on GitHub
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
☆478Mar 7, 2024Updated 2 years ago
YoadTew / zero-shot-image-to-text
View on GitHub
Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
☆279Sep 17, 2022Updated 3 years ago
DavidHuji / CapDec
View on GitHub
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
☆209Jan 28, 2024Updated 2 years ago
yxuansu / Awesome_Diffusions
View on GitHub
☆17Feb 20, 2023Updated 3 years ago
yxuansu / TaCL
View on GitHub
[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning
☆94Jun 8, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆212Dec 18, 2022Updated 3 years ago
yxuansu / Contrastive_Search_versus_Contrastive_Decoding
View on GitHub
An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation
☆27Jun 7, 2024Updated 2 years ago
MikeWangWZHL / VidIL
View on GitHub
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
☆117Sep 15, 2022Updated 3 years ago
Victorwz / VaLM
View on GitHub
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Mar 6, 2023Updated 3 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
bearcatt / LaBERT
View on GitHub
A length-controllable and non-autoregressive image captioning model.
☆69Jun 10, 2021Updated 5 years ago
ShiYaya / emscore
View on GitHub
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
woojeongjin / FewVLM
View on GitHub
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
☆42May 13, 2022Updated 4 years ago
TencentARC / FLM
View on GitHub
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆31May 15, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
LibertFan / ImageCaption
View on GitHub
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Sep 8, 2019Updated 6 years ago
Cartus / AMR-Parser
View on GitHub
Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)
☆10Jun 13, 2019Updated 7 years ago
OFA-Sys / OFA
View on GitHub
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…
☆2,557Apr 24, 2024Updated 2 years ago
zjr2000 / Untrimmed-Video-Feature-Extractor
View on GitHub
A simple and effective feature extractor for untrimmed videos
☆13Sep 1, 2022Updated 3 years ago
gmftbyGMFTBY / MomentumDecoding
View on GitHub
Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Jan 27, 2023Updated 3 years ago
YoadTew / zero-shot-video-to-text
View on GitHub
☆75Oct 22, 2022Updated 3 years ago
yxuansu / NAG-BERT
View on GitHub
[EACL'21] Non-Autoregressive with Pretrained Language Model
☆60Oct 10, 2022Updated 3 years ago
researchmm / generate-it
View on GitHub
A collection of models for image<->text generation in ACM MM 2021.
☆67Oct 31, 2021Updated 4 years ago
j-min / CLIP-Caption-Reward
View on GitHub
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
☆246Jun 10, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yxuansu / Contrastive_Search_Is_What_You_Need
View on GitHub
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆122Mar 5, 2023Updated 3 years ago
mad-red / VSR-guided-CIC
View on GitHub
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
☆36Mar 11, 2022Updated 4 years ago
yhlleo / MJP
View on GitHub
[CVPR 2023] An official Pytorch implementation of "Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers".
☆45Dec 21, 2024Updated last year
martiansideofthemoon / rankgen
View on GitHub
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆140Aug 2, 2023Updated 2 years ago
rmokady / CLIP_prefix_caption
View on GitHub
Simple image captioning model
☆1,421Jun 9, 2024Updated 2 years ago
kahne / NonAutoregGenProgress
View on GitHub
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
☆300Mar 15, 2023Updated 3 years ago
XiangLi1999 / Diffusion-LM
View on GitHub
Diffusion-LM
☆1,245Aug 8, 2024Updated last year
clip-vil / CLIP-ViL
View on GitHub
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
☆419Oct 28, 2022Updated 3 years ago
microsoft / LAVENDER
View on GitHub
A Unified Framework for Video-Language Understanding
☆62Jun 17, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lucidrains / x-clip
View on GitHub
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆724Oct 16, 2023Updated 2 years ago
kxz18 / Stylized-Story-Generation-with-Style-Guided-Planning
View on GitHub
Codes for paper "Stylized Story Generation with Style-Guided Planning"
☆11May 9, 2021Updated 5 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
princetonvisualai / imagecaptioning-bias
View on GitHub
Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"
☆12Mar 26, 2026Updated 3 months ago
princetonvisualai / SPICE-U
View on GitHub
☆11Sep 7, 2020Updated 5 years ago
jamespark3922 / visual-comet
View on GitHub
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
☆87Jun 12, 2023Updated 3 years ago
siwooyong / Codalab-Microsoft-COCO-Image-Captioning-Challenge
View on GitHub
🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)
☆23Apr 6, 2022Updated 4 years ago