neverbiasu / IELTSDuck
☆15Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for IELTSDuck
- Diffusion Transformers (DiTs) trained on MNIST dataset☆53Updated 7 months ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆51Updated last year
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆81Updated 6 months ago
- [ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models☆83Updated last month
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆32Updated last month
- ☆215Updated 7 months ago
- 包含程序员面试大厂面试题和面试经验☆98Updated 2 months ago
- ☆26Updated 7 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆100Updated last month
- 个人项目地址,一些大语言模型和多模态模型的应用☆117Updated last week
- ☆67Updated 6 months ago
- DeepSpeed Tutorial☆89Updated 3 months ago
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆9Updated last month
- 通义千问的DPO训练☆27Updated last month
- ☆30Updated 4 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆38Updated 2 months ago
- 🔥🔥First-ever hour scale video understanding models☆156Updated 2 weeks ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆51Updated 2 months ago
- [CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge☆121Updated 3 months ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆149Updated last month
- ☆99Updated 7 months ago
- ☆74Updated 4 months ago
- ☆23Updated 2 weeks ago
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆32Updated 2 months ago
- pytorch复现stable diffusion☆124Updated last year
- A collection list of AIGC detection related papers.☆64Updated last month
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆51Updated last week
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆49Updated 4 months ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆230Updated 2 months ago
- ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆93Updated 3 months ago