Victorwz/VaLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Victorwz/VaLM)

Victorwz / VaLM

VaLM: Visually-augmented Language Modeling. ICLR 2023.

☆56

Alternatives and similar repositories for VaLM

Users that are interested in VaLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
ImKeTT / ZeroGen
View on GitHub
[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
☆14Oct 7, 2023Updated 2 years ago
nlpapereading / nlpapereading
View on GitHub
☆58Sep 23, 2022Updated 3 years ago
zhjohnchan / bert-clip-synesthesia
View on GitHub
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Jun 7, 2023Updated 3 years ago
microsoft / FoundationModels
View on GitHub
☆13Aug 20, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
gmftbyGMFTBY / MomentumDecoding
View on GitHub
Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Jan 27, 2023Updated 3 years ago
YujieLu10 / IACE-NLU
View on GitHub
Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.
☆17Aug 30, 2022Updated 3 years ago
BlinkDL / LM-Trick-Questions
View on GitHub
Here we collect trick questions and failed tasks for open source LLMs to improve them.
☆32Apr 20, 2023Updated 3 years ago
shizhediao / DaVinci
View on GitHub
Source code for the paper "Prefix Language Models are Unified Modal Learners"
☆45Apr 30, 2023Updated 3 years ago
microsoft / Efficient-Large-LM-Trainer
View on GitHub
☆39Jul 25, 2024Updated 2 years ago
TencentARC / GVT
View on GitHub
Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".
☆59Jun 27, 2023Updated 3 years ago
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
UCSC-VLAA / Sight-Beyond-Text
View on GitHub
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
☆20Sep 15, 2023Updated 2 years ago
cognitiveailab / BYTESIZED32
View on GitHub
Byte-sized text games for code generation tasks on virtual environments
☆20Jul 8, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
HDETR / H-PETR-Pose
View on GitHub
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
☆14Sep 1, 2022Updated 3 years ago
McGill-NLP / diffusion-itm
View on GitHub
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Mar 15, 2024Updated 2 years ago
microsoft / klite
View on GitHub
[NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222
☆54Jun 12, 2023Updated 3 years ago
SLAB-NLP / Multi-Prompt-LLM-Evaluation
View on GitHub
State of What Art? A Call for Multi-Prompt LLM Evaluation
☆16Apr 10, 2026Updated 3 months ago
bozheng-hit / VoCapXLM
View on GitHub
Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"
☆20Nov 12, 2021Updated 4 years ago
researchmm / soho
View on GitHub
[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
☆208Sep 30, 2022Updated 3 years ago
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
yxuansu / MAGIC
View on GitHub
Language Models Can See: Plugging Visual Controls in Text Generation
☆261Jun 1, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jiyounglee-0523 / VisAlign
View on GitHub
☆20Apr 23, 2024Updated 2 years ago
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
google-deepmind / svo_probes
View on GitHub
The SVO-Probes Dataset for Verb Understanding
☆29Jan 28, 2022Updated 4 years ago
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
LeonHLJ / Teach-DETR
View on GitHub
Teach-DETR: Better Training DETR with Teachers
☆32Mar 18, 2024Updated 2 years ago
microsoft / FIBER
View on GitHub
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
☆131Oct 10, 2023Updated 2 years ago
EastTower16 / LLMDataDistill
View on GitHub
distill large scale web page text
☆12Jul 29, 2023Updated 2 years ago
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
View on GitHub
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆33May 16, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
linfeng93 / Large-UniDet
View on GitHub
A practice for million-scale multi-domain universal object detection
☆28Jun 13, 2024Updated 2 years ago
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆212Dec 18, 2022Updated 3 years ago
yonatanbitton / wysiwyr
View on GitHub
☆37Oct 7, 2023Updated 2 years ago
mukhal / intrinsic-source-citation
View on GitHub
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆19Apr 1, 2025Updated last year
wyu97 / RACo
View on GitHub
Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.
☆24Nov 23, 2022Updated 3 years ago
WangFei-2019 / SNARE
View on GitHub
Project for SNARE benchmark
☆11Jun 5, 2024Updated 2 years ago
CheungZeeCn / fairseq
View on GitHub
rebert model codes based on fariseq
☆15Feb 28, 2021Updated 5 years ago