neverbiasu / IELTSDuck
☆15Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for IELTSDuck
- Diffusion Transformers (DiTs) trained on MNIST dataset☆61Updated 7 months ago
- 一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等☆124Updated 2 weeks ago
- 通义千问的DPO训练☆27Updated 2 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆105Updated 2 months ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆51Updated last year
- DeepSpeed Tutorial☆90Updated 3 months ago
- Materials for the Hugging Face Diffusion Models Course☆169Updated last year
- [ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models☆85Updated this week
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆89Updated 6 months ago
- ☆223Updated 8 months ago
- pytorch复现stable diffusion☆132Updated last year
- ☆69Updated 6 months ago
- 包含程序员面试大厂面试题和面试经验☆106Updated 3 months ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆236Updated 2 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆21Updated 2 months ago
- ☆77Updated 4 months ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆83Updated last month
- ☆51Updated 8 months ago
- ☆26Updated 7 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆53Updated 3 weeks ago
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆82Updated 7 months ago
- ☆32Updated 5 months ago
- 🔥🔥First-ever hour scale video understanding models☆170Updated 3 weeks ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆35Updated 2 months ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆51Updated 2 months ago
- [CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge☆122Updated 4 months ago
- ☆127Updated last year
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆234Updated 6 months ago
- Modified LLaVA framework for MOSS2, and makes MOSS2 a multimodal model.☆12Updated 2 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆118Updated last year