shizhediao/DaVinci

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shizhediao/DaVinci)

shizhediao / DaVinci

Source code for the paper "Prefix Language Models are Unified Modal Learners"

☆45

Alternatives and similar repositories for DaVinci

Users that are interested in DaVinci are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nuaa-nlp / Multimodality
View on GitHub
☆15Dec 10, 2021Updated 4 years ago
VincentDENGP / 3D-LR
View on GitHub
Can 3D Vision-Language Models Truly Understand Natural Language?
☆20Mar 28, 2024Updated 2 years ago
zhjohnchan / bert-clip-synesthesia
View on GitHub
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Jun 7, 2023Updated 3 years ago
Share14 / ShareGemini
View on GitHub
☆32Jul 29, 2024Updated 2 years ago
ychen-stat-ml / kernel-adapters
View on GitHub
Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…
☆11Feb 6, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
woojeongjin / FewVLM
View on GitHub
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
☆42May 13, 2022Updated 4 years ago
philschmid / deep-learning-remote-runner
View on GitHub
☆16Aug 10, 2022Updated 3 years ago
HYPJUDY / Sparkles
View on GitHub
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
☆46Jun 14, 2024Updated 2 years ago
ajd12342 / why-winoground-hard
View on GitHub
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31May 29, 2023Updated 3 years ago
shizhediao / automate-cot
View on GitHub
Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"
☆20Feb 24, 2024Updated 2 years ago
zhjohnchan / PTUnifier
View on GitHub
[ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
☆78Mar 22, 2024Updated 2 years ago
microsoft / FIBER
View on GitHub
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
☆131Oct 10, 2023Updated 2 years ago
cooelf / CompassMTL
View on GitHub
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)
☆22Oct 17, 2022Updated 3 years ago
naver-ai / elva
View on GitHub
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …
☆20Mar 13, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / BridgeTower
View on GitHub
Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"
☆168Jul 6, 2023Updated 3 years ago
NEUIR / P3Ranker
View on GitHub
[SIGIR '22] Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Pr…
☆18Sep 24, 2023Updated 2 years ago
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
swaggy-TN / EfficientVLM
View on GitHub
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)
☆33Jul 18, 2023Updated 3 years ago
JIA-Lab-research / AGSS-VOS
View on GitHub
AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation
☆20Sep 27, 2021Updated 4 years ago
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
neulab / gemini-benchmark
View on GitHub
☆151Jan 4, 2024Updated 2 years ago
zerovl / ZeroVL
View on GitHub
[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources
☆46Sep 29, 2022Updated 3 years ago
Georgelingzj / up-to-date-Vision-Language-Models
View on GitHub
Up-to-date Vision Language Models collection. Mainly focus on computer vision
☆20Feb 9, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
FreedomIntelligence / DPTDR
View on GitHub
Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
☆26Aug 7, 2023Updated 2 years ago
INK-USC / ReCross
View on GitHub
ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation
☆23May 1, 2022Updated 4 years ago
belindal / LaMPP
View on GitHub
Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action
☆37Apr 3, 2023Updated 3 years ago
MichaelZhouwang / Sequence_Span_Rewriting
View on GitHub
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Nov 30, 2021Updated 4 years ago
Timothyxxx / LMsMBTI
View on GitHub
A MBTI test on Large Language Model like GPT-3.
☆28May 2, 2022Updated 4 years ago
TobiasLee / VEC
View on GitHub
Visual and Embodied Concepts evaluation benchmark
☆21Oct 10, 2023Updated 2 years ago
Toloka / BestPrompts
View on GitHub
Best Prompts for Text-to-Image Models
☆25Jan 20, 2024Updated 2 years ago
duskybomb / hopfield-network
View on GitHub
Implementation of Hopfield Neural Network in Python based on Hebbian Learning Algorithm
☆13Aug 10, 2019Updated 6 years ago
timjogorman / Multisentence-AMR-guidelines
View on GitHub
Guidelines for our secondary layer of annotation adding multi-sentence AMR links
☆12Sep 6, 2017Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nuaa-nlp / Evaluation-of-ChatGPT
View on GitHub
☆14Apr 15, 2023Updated 3 years ago
shizhediao / ChatGPTPapers
View on GitHub
Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.
☆333Aug 10, 2023Updated 2 years ago
junha1125 / Vision-Language-Model-in-ECCV-2024
View on GitHub
☆17Oct 1, 2024Updated last year
YifanZhang07 / Core-tuning
View on GitHub
This repository is the official implementation of Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regulari…
☆21Dec 17, 2022Updated 3 years ago
Aketirani / audio-mnist
View on GitHub
Gender Recognition By Voice Analysis
☆12May 24, 2025Updated last year
kaistAI / Volcano
View on GitHub
[NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…
☆49Aug 21, 2024Updated last year
williamium3000 / awesome-mllm-grounding
View on GitHub
Awesome paper for multi-modal llm with grounding ability
☆21Oct 11, 2025Updated 9 months ago