An open-source implementaion for fine-tuning Llama3.2-Vision series by Meta.
☆176Oct 21, 2025Updated 5 months ago
Alternatives and similar repositories for Llama3.2-Vision-Finetune
Users that are interested in Llama3.2-Vision-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source implementaion for fine-tuning Pixtral by MistralAI.☆23Feb 5, 2025Updated last year
- An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.☆1,808Apr 10, 2026Updated last week
- An open-source implementaion for fine-tuning SmolVLM.☆65Sep 12, 2025Updated 7 months ago
- ☆390Feb 8, 2025Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Aug 22, 2018Updated 7 years ago
- [JCIM'25] AccFG: Functional Group Extraction and Molecular Structure Comparison☆28Dec 27, 2025Updated 3 months ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- ☆13May 12, 2025Updated 11 months ago
- NodeJS application frawework, service oriented, isomorphic, real-time☆27Mar 1, 2026Updated last month
- Wikitext format dataset of Namuwiki (Most famous Korean wikipedia)☆53Oct 25, 2020Updated 5 years ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…☆22Jul 11, 2025Updated 9 months ago
- Project to implement phase based video motion magnification in python☆14Jun 21, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Oct 1, 2024Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 3 months ago
- Image Text Segmentation using FAST corner detection and DBSCAN clustering with k-d tree data structure☆14Feb 27, 2019Updated 7 years ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆35Jan 2, 2025Updated last year
- ☆19Dec 19, 2025Updated 4 months ago
- WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)☆12Jan 5, 2025Updated last year
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆57Updated this week
- The most modern LLM evaluation toolkit☆70Nov 9, 2025Updated 5 months ago
- Can VLMs understand students' hand-drawn math work?☆18Jan 20, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆20May 27, 2025Updated 10 months ago
- A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision,…☆371Feb 28, 2026Updated last month
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca☆18Nov 1, 2023Updated 2 years ago
- Korean Commonsense Knowledge Graph☆15Dec 23, 2022Updated 3 years ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- [ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models☆43Jun 4, 2024Updated last year
- Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head☆14Nov 7, 2022Updated 3 years ago
- 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time☆11Dec 18, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts☆337Jul 17, 2024Updated last year
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 9 months ago
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆36Jul 22, 2025Updated 8 months ago
- Gradio chat interface for FastMLX☆12Sep 22, 2024Updated last year
- ☆12Mar 28, 2022Updated 4 years ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Jul 14, 2025Updated 9 months ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago