《多模态大模型:新一代人工智能技术范式》配套教学资源
☆301Jun 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for Book-of-MLM
Users that are interested in Book-of-MLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆57Aug 12, 2025Updated 10 months ago
- 大型语言模型实战指南:应用实践与场景落地☆89Sep 13, 2024Updated last year
- 《基于BERT模型的自然语言处理实战》随书代码☆17Jun 13, 2022Updated 4 years ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆51Apr 27, 2025Updated last year
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 从ICCV等网页上爬取论文列表,并获取ArXiv的相关资料☆14Oct 19, 2023Updated 2 years ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆78Jul 21, 2025Updated 11 months ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- 🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.☆8,584Jun 24, 2026Updated last week
- ☆85Jun 16, 2026Updated 2 weeks ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆34Feb 10, 2026Updated 4 months ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆30Mar 26, 2024Updated 2 years ago
- [IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition☆29Jan 6, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆286May 12, 2024Updated 2 years ago
- ☆21Mar 1, 2022Updated 4 years ago
- Open Source Road Datasets☆19Aug 30, 2024Updated last year
- ☆37Aug 30, 2024Updated last year
- ☆24Apr 16, 2022Updated 4 years ago
- UGRoadUpd: An unchanged-guided road updating framework based on remotely sensed imagery☆12Mar 15, 2023Updated 3 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆78Jul 6, 2023Updated 2 years ago
- Code and Model of the SwinTD_Net for Single Image Dehazing☆11Mar 23, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A computer vision system was built to detect objects in an indoor scene using point clouds using a deep learning approach. PyTorch was us…☆13Jun 28, 2021Updated 5 years ago
- Large Language Model in Action☆345Jan 28, 2025Updated last year
- ☆22Nov 5, 2024Updated last year
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆4,509Sep 2, 2025Updated 9 months ago
- Implementation for "Change Event Dataset for Discovery from Spatio-temporal Remote Sensing Imagery"☆17Aug 9, 2022Updated 3 years ago
- chatglm多gpu用deepspeed和☆409Jul 8, 2024Updated last year
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- Official release of FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model (ACMMM2024)☆27Nov 11, 2024Updated last year
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆508May 10, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [AAAI 2024] M3SOT: Multi-frame, Multi-field, Multi-space 3D Single Object Tracking☆16Apr 29, 2024Updated 2 years ago
- https://haa.boyuai.com☆100Dec 8, 2025Updated 6 months ago
- ☆40Aug 26, 2025Updated 10 months ago
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆30Jan 16, 2026Updated 5 months ago
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆24,604Updated this week
- 😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D☆65Jul 27, 2025Updated 11 months ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆21Nov 24, 2025Updated 7 months ago