YujieLu10/TIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YujieLu10/TIP)

YujieLu10 / TIP

Multimodal-Procedural-Planning

☆92

Alternatives and similar repositories for TIP

Users that are interested in TIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YujieLu10 / IACE-NLU
View on GitHub
Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.
☆17Aug 30, 2022Updated 3 years ago
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
VegB / iNLG
View on GitHub
Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".
☆17Feb 3, 2023Updated 3 years ago
tsujuifu / pytorch_sscr
View on GitHub
A PyTorch implementation of SSCR
☆23Aug 12, 2024Updated last year
allenai / learning_from_interaction
View on GitHub
Learning about objects and their properties by interacting with them
☆12Oct 21, 2020Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
facebookresearch / ProcedureVRL
View on GitHub
[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"
☆56Aug 8, 2023Updated 2 years ago
YujieLu10 / LLMScore
View on GitHub
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
☆135Oct 25, 2023Updated 2 years ago
Vision-CAIR / ChatCaptioner
View on GitHub
Official Repository of ChatCaptioner
☆468Apr 13, 2023Updated 3 years ago
pleaseconnectwifi / DANCE
View on GitHub
PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)
☆23Nov 29, 2022Updated 3 years ago
xiangyu-mm / EasyGen
View on GitHub
The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"
☆73Nov 21, 2024Updated last year
M3-IT / YING-VLM
View on GitHub
Vision Large Language Models trained on M3IT instruction tuning dataset
☆17Aug 16, 2023Updated 2 years ago
salesforce / paprika
View on GitHub
Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"
☆50Jun 2, 2026Updated last month
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
Leezekun / dialogic
View on GitHub
[EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"
☆34Feb 22, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
matt-seb-ho / WikiWhy
View on GitHub
WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…
☆49Dec 7, 2023Updated 2 years ago
fudan-zvg / TDAS
View on GitHub
☆18Jun 10, 2022Updated 4 years ago
zjuchenlong / WSAG
View on GitHub
[EMNLP'22] Weakly-Supervised Temporal Article Grounding
☆14Nov 25, 2023Updated 2 years ago
medhini / Instructional-Video-Summarization
View on GitHub
Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022
☆39Feb 17, 2023Updated 3 years ago
limanling / KnowledgeVL-Reading
View on GitHub
☆67Jun 18, 2023Updated 3 years ago
microsoft / MM-REACT
View on GitHub
Official repo for MM-REACT
☆967Jan 31, 2024Updated 2 years ago
qywu / FaceChat
View on GitHub
☆15Feb 28, 2023Updated 3 years ago
UCSB-AI / CPL
View on GitHub
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆35Dec 5, 2022Updated 3 years ago
Wangt-CN / EqBen
View on GitHub
[ICCV'23 Oral] The introduction and toolkit for EqBen Benchmark
☆123Dec 11, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SuReLI / llrl
View on GitHub
Lipschitz Lifelong RL
☆11Nov 6, 2020Updated 5 years ago
soCzech / LookForTheChange
View on GitHub
Code for Look for the Change paper published at CVPR 2022
☆36Oct 26, 2022Updated 3 years ago
liyongqi67 / MMCoQA
View on GitHub
☆31Dec 19, 2023Updated 2 years ago
kyegomez / AnyMAL
View on GitHub
The open source implementation of "AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model"
☆22Jan 27, 2025Updated last year
hongwang600 / Summarization
View on GitHub
☆39Aug 2, 2019Updated 6 years ago
mihdalal / optimus
View on GitHub
☆14Nov 1, 2023Updated 2 years ago
michaelsaxon / CoCoCroLa
View on GitHub
The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models
☆12Oct 28, 2024Updated last year
VALUE-Leaderboard / StarterCode
View on GitHub
Starter Code for VALUE benchmark
☆79Aug 23, 2022Updated 3 years ago
j-min / HiREST
View on GitHub
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆110Jan 23, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thorinf / simple-diffusion-lm
View on GitHub
A simple DIffusion LM approach.
☆27May 22, 2023Updated 3 years ago
UCSB-NLP-Chang / diffusion_resampling
View on GitHub
Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]
☆34Dec 12, 2023Updated 2 years ago
WildVision-AI / WildVision-Bench
View on GitHub
☆17Oct 21, 2024Updated last year
zzxslp / XL-VLN
View on GitHub
Dataset for Bilingual VLN
☆11Dec 5, 2020Updated 5 years ago
yfyuan01 / MultiturnFashionRetrieval
View on GitHub
SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback
☆14Oct 17, 2022Updated 3 years ago
pkunlp-icler / PCA-EVAL
View on GitHub
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆107Mar 14, 2024Updated 2 years ago
jq2276 / Learning2Copy
View on GitHub
☆20Feb 4, 2021Updated 5 years ago