feizc/Visual-LLaMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/feizc/Visual-LLaMA)

feizc / Visual-LLaMA

Open LLaMA Eyes to See the World

☆175

Alternatives and similar repositories for Visual-LLaMA

Users that are interested in Visual-LLaMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

feizc / Perceiver-Music-Generation
View on GitHub
music generation with perceiver-ar model
☆26Jul 20, 2022Updated 4 years ago
feizc / Visual-ChatGLM
View on GitHub
Open ChatGLM Eyes to See the World
☆13Mar 30, 2023Updated 3 years ago
Victorwz / VaLM
View on GitHub
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Mar 6, 2023Updated 3 years ago
feizc / IEA
View on GitHub
Image Editing Anything
☆114Apr 11, 2023Updated 3 years ago
CLUEbenchmark / LGEB
View on GitHub
LGEB: Benchmark of Language Generation Evaluation
☆16Oct 21, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
feizc / Gradient-Free-Textual-Inversion
View on GitHub
Gradient-Free Textual Inversion for Personalized Text-to-Image Generation
☆44Jan 23, 2023Updated 3 years ago
linfeng93 / Large-UniDet
View on GitHub
A practice for million-scale multi-domain universal object detection
☆28Jun 13, 2024Updated 2 years ago
feizc / MLE-LLaMA
View on GitHub
Multi-language Enhanced LLaMA
☆301Apr 13, 2023Updated 3 years ago
feizc / Vespa
View on GitHub
Video Diffusion State Space Models
☆19Mar 27, 2024Updated 2 years ago
mshukor / eP-ALM
View on GitHub
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Oct 27, 2023Updated 2 years ago
Meituan-AutoML / VisionLLaMA
View on GitHub
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
☆392Jul 9, 2024Updated 2 years ago
BlinkDL / LM-Trick-Questions
View on GitHub
Here we collect trick questions and failed tasks for open source LLMs to improve them.
☆32Apr 20, 2023Updated 3 years ago
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,916Mar 14, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
X-PLUG / mPLUG-Owl
View on GitHub
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
☆2,535Apr 2, 2025Updated last year
HDETR / H-PETR-Pose
View on GitHub
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
☆14Sep 1, 2022Updated 3 years ago
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
feizc / Video-Stable-Diffusion
View on GitHub
Generate consistent videos with stable diffusion models
☆51Jan 20, 2023Updated 3 years ago
VisionLearningGroup / visda21-dev
View on GitHub
☆46Aug 25, 2021Updated 4 years ago
mlfoundations / open_flamingo
View on GitHub
An open-source framework for training large multimodal models.
☆4,114Aug 31, 2024Updated last year
Flowerfan / Trackron
View on GitHub
Unified Object Tracking Framework
☆51Jun 20, 2022Updated 4 years ago
luogen1996 / LaVIN
View on GitHub
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
☆522Jan 27, 2024Updated 2 years ago
CASIA-LMC-Lab / Obj2Seq
View on GitHub
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
☆85Nov 2, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
wvangansbeke / Revisiting-Contrastive-SSL
View on GitHub
Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]
☆89Oct 2, 2021Updated 4 years ago
muirbench / MuirBench
View on GitHub
A Comprehensive Benchmark for Robust Multi-image Understanding
☆21Sep 4, 2024Updated last year
lucabarsellotti / awesome-open-vocabulary-semantic-segmentation
View on GitHub
☆15May 7, 2024Updated 2 years ago
yuhangzang / OV-DETR
View on GitHub
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
☆240Aug 3, 2022Updated 3 years ago
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago
lc222 / BELLE-LORA
View on GitHub
LORA微调BLOOMZ，参考BELLE
☆25Mar 24, 2023Updated 3 years ago
palchenli / VL-Instruction-Tuning
View on GitHub
☆90Nov 25, 2023Updated 2 years ago
yaohungt / Cross-Domain-Landmarks-Selection-CDLS-
View on GitHub
[CVPR'16] [MATLAB] Cross Domain Landmarks Selection
☆14Jul 5, 2016Updated 10 years ago
clin1223 / VLDet
View on GitHub
[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）
☆191Mar 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
The-Shuai / DeIL
View on GitHub
Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.
☆19Nov 4, 2024Updated last year
BAAI-DCAI / Visual-Instruction-Tuning
View on GitHub
SVIT: Scaling up Visual Instruction Tuning
☆167Jun 20, 2024Updated 2 years ago
princetonvisualai / pointingqa
View on GitHub
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
dhg-wei / DeCap
View on GitHub
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
☆144Mar 16, 2023Updated 3 years ago
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,686Aug 1, 2024Updated last year
Vision-CAIR / ChatCaptioner
View on GitHub
Official Repository of ChatCaptioner
☆468Apr 13, 2023Updated 3 years ago
thunlp / Seq2Seq-Prompt
View on GitHub
Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"
☆24Sep 21, 2022Updated 3 years ago