《多模态大模型:新一代人工智能技术范式》配套教学资源
☆301Jun 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for Book-of-MLM
Users that are interested in Book-of-MLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI☆2,093Jun 10, 2026Updated 3 weeks ago
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆57Aug 12, 2025Updated 10 months ago
- 大型语言模型实战指南:应用实践与场景落地☆89Sep 13, 2024Updated last year
- VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis☆13Dec 26, 2024Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆20Jul 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆28Apr 18, 2025Updated last year
- 《基于BERT模型的自然语言处理实战》随书代码☆17Jun 13, 2022Updated 4 years ago
- ☆14Oct 23, 2023Updated 2 years ago
- Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition (Complexity 2018)☆13Dec 14, 2022Updated 3 years ago
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆78Jul 21, 2025Updated 11 months ago
- [ECCV 2024 Workshop🎈] The first agriculture benchmark to evaluate MM-LLMs.☆26Jan 1, 2025Updated last year
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆85Jun 16, 2026Updated 2 weeks ago
- CausalVLR: A Toolbox and Benchmark for Vision-Language Causal Reasoning (多模态因果推理开源框架)☆1,055Oct 11, 2025Updated 8 months ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆34Feb 10, 2026Updated 4 months ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆30Mar 26, 2024Updated 2 years ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆288May 12, 2024Updated 2 years ago
- Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)☆36Jan 18, 2025Updated last year
- KALM: Keypoint Abstraction using Large Models for Object-Relative Imitation Learning, ICRA 2025 & CoRL 24 WS☆28Sep 2, 2025Updated 9 months ago
- Open Source Road Datasets☆19Aug 30, 2024Updated last year
- The official implementation of “Cross-Modal Causal Representation Learning for Radiology Report Generation” (IEEE T-IP 2025)☆68May 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- 本 书作者是来自日本的Yutaro Ogawa(小川熊太郎),作者的github上源码是日文注释的,这个repository把它翻译成中文☆22Dec 2, 2020Updated 5 years ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆78Jul 6, 2023Updated 2 years ago
- A computer vision system was built to detect objects in an indoor scene using point clouds using a deep learning approach. PyTorch was us…☆13Jun 28, 2021Updated 5 years ago
- Large Language Model in Action☆345Jan 28, 2025Updated last year
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆4,509Sep 2, 2025Updated 9 months ago
- Implementation for "Change Event Dataset for Discovery from Spatio-temporal Remote Sensing Imagery"☆17Aug 9, 2022Updated 3 years ago
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆37Mar 6, 2025Updated last year
- [ICCV 2023] Official repository of paper titled "Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?"☆27Sep 20, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Aug 20, 2025Updated 10 months ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆508May 10, 2024Updated 2 years ago
- Efficiently Combining Positions and Normals for Precise 3D Geometry☆30Apr 30, 2020Updated 6 years ago
- ☆40Aug 26, 2025Updated 10 months ago
- [TVCG 2024] Official implementation of "JIMR: Joint Semantic and Geometry Learning for Point Scene Instance Mesh Reconstruction”☆15Jan 7, 2026Updated 5 months ago
- [TIE 2024] SEE-CSOM: Sharp-Edged and Efficient Continous Semantic Occupancy Mapping through Multi-entropy Kernel Inference☆22Sep 8, 2024Updated last year
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆24,604Updated this week