《多模态大模型:新一代人工智能技术范式》配套教学资源
☆293May 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for Book-of-MLM
Users that are interested in Book-of-MLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 大型语言模型实战指南:应用实践与场景落地☆88Sep 13, 2024Updated last year
- 从ICCV等网页上爬取论文列表,并获取ArXiv的相关资料☆14Oct 19, 2023Updated 2 years ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- 🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.☆8,483Updated this week
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆34Feb 10, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆29Mar 26, 2024Updated 2 years ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆285May 12, 2024Updated 2 years ago
- Implementation for What it Thinks is Important is Important: Robustness Transfers through Input Gradients (CVPR 2020 Oral)☆16Mar 24, 2023Updated 3 years ago
- Open Source Road Datasets☆19Aug 30, 2024Updated last year
- Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis☆13Mar 17, 2021Updated 5 years ago
- ☆24Apr 16, 2022Updated 4 years ago
- UGRoadUpd: An unchanged-guided road updating framework based on remotely sensed imagery☆12Mar 15, 2023Updated 3 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- 本书作者是来自日本的Yutaro Ogawa(小川熊太郎),作者的github上源码是日文注释的,这个repository把它翻译成中文☆22Dec 2, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A multimodal model bridging vision and genomics for biodiversity monitoring at scale.☆19May 11, 2026Updated 3 weeks ago
- Large Language Model in Action☆344Jan 28, 2025Updated last year
- ☆22Nov 5, 2024Updated last year
- Implementation for "Change Event Dataset for Discovery from Spatio-temporal Remote Sensing Imagery"☆17Aug 9, 2022Updated 3 years ago
- chatglm多gpu用deepspeed和☆409Jul 8, 2024Updated last year
- [ICCV 2023] Official repository of paper titled "Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?"☆27Sep 20, 2023Updated 2 years ago
- ☆11Aug 20, 2025Updated 9 months ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆508May 10, 2024Updated 2 years ago
- https://haa.boyuai.com☆95Dec 8, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆39Aug 26, 2025Updated 9 months ago
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆28Jan 16, 2026Updated 4 months ago
- [TVCG 2024] Official implementation of "JIMR: Joint Semantic and Geometry Learning for Point Scene Instance Mesh Reconstruction”☆15Jan 7, 2026Updated 5 months ago
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆24,452May 25, 2026Updated 2 weeks ago
- 😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D☆67Jul 27, 2025Updated 10 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆21Nov 24, 2025Updated 6 months ago
- This is the official code for "Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning"☆24Apr 2, 2026Updated 2 months ago
- ☆62May 30, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CSANet: Cross-Temporal Interaction Symmetric Attention Network for Hyperspectral Image Change Detection☆12Sep 13, 2022Updated 3 years ago
- ☆20Jul 23, 2024Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- 基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试☆53Aug 4, 2025Updated 10 months ago
- 水果书,主要内容包括:数据结构与算法,操作系统,计算机网络,数据库,go语言,架构设计与人工智能等。☆12Jun 24, 2021Updated 4 years ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 11 months ago
- 软件概论作业, 五子棋小游戏, 人机对战, 局域网联机, 本地对战☆14Nov 17, 2022Updated 3 years ago