Vision-Language Pretraining & Efficient Transformer Papers.
☆15Nov 30, 2021Updated 4 years ago
Alternatives and similar repositories for Awesome-VLP-and-Efficient-Transformer
Users that are interested in Awesome-VLP-and-Efficient-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of paper "Can Textual Gradient Work in Federated Learning?" accepted at ICLR 2025☆16Mar 10, 2025Updated last year
- Paper reading notes in the field of Image-Text Matching/Retrieval.☆13Mar 25, 2022Updated 4 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- This project is an unofficial summary of the resources related to VALSE and its annual seminar. Its main purpose is to more facilitate yo…☆20Jun 16, 2024Updated last year
- ☆29Aug 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated last year
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- This repository contains recent research on uncertainty estimation. Inspired from other 'awesome' github pages like awesome-deep-learnin…☆18Sep 16, 2020Updated 5 years ago
- A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.☆11Mar 11, 2015Updated 11 years ago
- ☆14Dec 25, 2024Updated last year
- ☆14May 10, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- An pytorch implementation of our NeurIPS paper of PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph☆53Nov 22, 2022Updated 3 years ago
- Dataset of China's-image-related tweets during COVID-19 with aspect-level sentiment labels.☆17Feb 2, 2021Updated 5 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 4 years ago
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆166Dec 11, 2022Updated 3 years ago
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆39Mar 22, 2021Updated 5 years ago
- modification of official bert for downstream task☆32Mar 24, 2023Updated 3 years ago
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆28Nov 1, 2025Updated 4 months ago
- GCNs Analysis: Visualization, Error Cases etc.☆14Feb 15, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code for GIFT (CIKM 22)☆12Oct 23, 2022Updated 3 years ago
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- Official PyTorch codes for "Enhancing Diffusion Models with Text-Encoder Reinforcement Learning", ECCV2024☆57Aug 13, 2024Updated last year
- Batch MultiHead Graph Attention Pytorch☆12Apr 4, 2020Updated 5 years ago
- Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.☆12Mar 13, 2026Updated 2 weeks ago
- Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020☆13May 2, 2022Updated 3 years ago
- Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)☆17Jun 23, 2023Updated 2 years ago
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- tool to convert waymo open dataset to rosbag☆15Jun 4, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A bert-fusing architecture for twitter sentiment analysis. accepted in AACL-IJCNLP 2020 Student Research Workshop.☆11Jun 12, 2023Updated 2 years ago
- Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)☆16Apr 23, 2024Updated last year
- MLPruning, PyTorch, NLP, BERT, Structured Pruning☆20Jun 29, 2021Updated 4 years ago
- ICLR 2023 - FedFA: Federated Feature Augmentation☆59Mar 28, 2023Updated 3 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆12Jan 3, 2025Updated last year
- 百度语音合成☆20Oct 12, 2017Updated 8 years ago
- ☆21Jul 25, 2024Updated last year