haotian-liu / transformers_llavaLinks
☆13Updated 2 years ago
Alternatives and similar repositories for transformers_llava
Users that are interested in transformers_llava are comparing it to the libraries listed below
Sorting:
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆32Updated 2 years ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- The open source community's implementation of the all-new Multi-Modal Causal Attention from "DeepSpeed-VisualChat: Multi-Round Multi-Imag…☆11Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- ☆13Updated 9 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆13Updated last year
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆10Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 6 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆12Updated 6 months ago
- Github repo for Peifeng's internship project☆13Updated last year
- Load any clip model with a standardized interface☆20Updated last year
- Tools for content datamining and NLP at scale☆43Updated 11 months ago
- ☆29Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 10 months ago
- ☆19Updated 2 months ago
- Finetune any model on HF in less than 30 seconds☆57Updated last month
- ☆20Updated 11 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated 6 months ago
- NExT-GPT: Any-to-Any Multimodal Large Language Model☆19Updated 7 months ago
- A fast approach for translating a series of text prompts into a video. The 2022 NeurIPS Workshop on Machine Learning for Creativity and D…☆32Updated last year