2U1 / Phi3-Vision-Finetune
An open-source implementaion for fine-tuning Phi3-Vision and Phi3.5-Vision by Microsoft.
☆72Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Phi3-Vision-Finetune
- An open-source implementaion for fine-tuning Llama3.2-Vision series by Meta.☆84Updated 2 weeks ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆54Updated 5 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆88Updated 5 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆58Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve perfor…☆310Updated 7 months ago
- 1-Click is all you need.☆59Updated 6 months ago
- ☆55Updated last year
- MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning☆341Updated 3 months ago
- evolve llm training instruction, from english instruction to any language.☆113Updated last year
- E5-V: Universal Embeddings with Multimodal Large Language Models☆175Updated 4 months ago
- a family of highly capabale yet efficient large multimodal models☆167Updated 3 months ago
- [NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…☆102Updated 5 months ago
- [ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mO…☆93Updated 4 months ago
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆14Updated 2 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆115Updated 10 months ago
- The Universe of Evaluation. All about the evaluation for LLMs.☆219Updated 4 months ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆14Updated this week
- Newsletter bot for 🤗 Daily Papers☆107Updated this week
- An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.☆120Updated 2 weeks ago
- ☆53Updated this week
- ☆17Updated this week
- LLaVA-HR: High-Resolution Large Language-Vision Assistant☆212Updated 3 months ago
- LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images☆319Updated last month
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆179Updated last month
- (WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, B…☆81Updated 2 months ago
- Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)☆112Updated 2 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆84Updated 2 months ago
- This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks"☆77Updated this week
- An open-source implementaion for fine-tuning Molmo-7B-D and Molmo-7B-O by allenai.☆30Updated last month