zh-plus / Awesome-VLP-and-Efficient-TransformerView external linksLinks
Vision-Language Pretraining & Efficient Transformer Papers.
☆15Nov 30, 2021Updated 4 years ago
Alternatives and similar repositories for Awesome-VLP-and-Efficient-Transformer
Users that are interested in Awesome-VLP-and-Efficient-Transformer are comparing it to the libraries listed below
Sorting:
- The official implementation of paper "Can Textual Gradient Work in Federated Learning?" accepted at ICLR 2025☆16Mar 10, 2025Updated 11 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆15Dec 10, 2024Updated last year
- This project is an unofficial summary of the resources related to VALSE and its annual seminar. Its main purpose is to more facilitate yo…☆20Jun 16, 2024Updated last year
- kenlm语言模型,并提供python的rest服务☆30Aug 1, 2018Updated 7 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- modification of official bert for downstream task☆32Mar 24, 2023Updated 2 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Aug 25, 2021Updated 4 years ago
- Multiple Meta-model Quantifying for Medical Visual Question Answering (MICCAI 2021)☆37Oct 12, 2022Updated 3 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Apr 22, 2021Updated 4 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆38Mar 22, 2021Updated 4 years ago
- Quiz and assignment solutions for Coursera MOOC - Aerial Robotics☆13Aug 15, 2016Updated 9 years ago
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- ☆14Dec 25, 2024Updated last year
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- Action recognition based on action graph, which describes the spatio-temporal relationship between dense trajectory clusters. The program…☆11Jan 7, 2015Updated 11 years ago
- Multi-labels anime image classification in rust☆12Mar 10, 2023Updated 2 years ago
- Port of Chromaprint C/C++ library to Ruby to extract fingerprints from audio sources.☆12Nov 7, 2013Updated 12 years ago
- ☆11Jul 17, 2024Updated last year
- A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.☆11Mar 11, 2015Updated 10 years ago
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Single shot neural network pruning before training the model, based on connection sensitivity☆11Aug 7, 2019Updated 6 years ago
- Vectorize Image Data to SVG using POTRACE. Based on multilabel-potrace by Hugo Raguet, which is based on potrace by Peter Selinger.☆15Jul 26, 2025Updated 6 months ago
- Official implementation of Lightweight Human Pose Estimation Using Loss Weighted by Target Heatmap that was honorably mentioned as Best P…☆12Dec 17, 2023Updated 2 years ago
- The source code of paper “HAZY RE-ID: AN INTERFERENCE SUPPRESSION MODEL FOR DOMAIN ADAPTATION PERSON RE-IDENTIFICATION UNDER INCLEMENT WE…☆12May 26, 2021Updated 4 years ago
- python3 利用用TF特征向量和Simhash指纹计算中文文本的相似度的示例☆10Dec 13, 2019Updated 6 years ago
- Pytorch implementation of various token mixers; Attention Mechanisms, MLP, and etc for understanding computer vision papers and other tas…☆16Oct 7, 2024Updated last year
- Autoencoder for multi-label classification using Google's Tensorflow framework and MDMR for feature selection.☆10Aug 31, 2017Updated 8 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- ☆11Mar 19, 2024Updated last year
- Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation☆10Oct 27, 2020Updated 5 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- Deep learning for named entity recognition on CoNLL-2003☆10Dec 23, 2016Updated 9 years ago