2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连接视觉编码器与语言模型。2026.01:reyes-0.6B
☆31Feb 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for Reyes
Users that are interested in Reyes are comparing it to the libraries listed below
Sorting:
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆80Jun 17, 2024Updated last year
- ☆35Mar 2, 2025Updated last year
- FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models☆10Dec 21, 2025Updated 2 months ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆59May 26, 2025Updated 9 months ago
- A part of the course Mobile Application Development☆13Nov 30, 2021Updated 4 years ago
- ☆10Aug 9, 2023Updated 2 years ago
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- [NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆61Jul 1, 2025Updated 8 months ago
- ☆15Nov 11, 2024Updated last year
- Panorama_498全景图像数据集☆14Apr 8, 2022Updated 3 years ago
- MRZ recognition from visa and passport documents.☆23Jan 13, 2026Updated last month
- 《大语言模型》综述全书学习笔记☆13Aug 2, 2024Updated last year
- MLOps Pipeline for Amazon Forecast written in AWS CDK☆11Apr 10, 2025Updated 11 months ago
- My blogs and code for machine learning. http://cnblogs.com/pinard☆13Jul 12, 2019Updated 6 years ago
- ☆12Jun 19, 2024Updated last year
- YOLO格式转为COCO格式。Convert data format from YOLO format to coco format☆15Nov 1, 2023Updated 2 years ago
- Agent to integrate Webdriver.io with ReportPortal.☆11Feb 12, 2026Updated 3 weeks ago
- ☆12Aug 17, 2022Updated 3 years ago
- A simple hello world Python application.☆11Jun 14, 2023Updated 2 years ago
- The Source Code of FRNet☆43Nov 7, 2022Updated 3 years ago
- 基于langchain和chatglm6b构建的智能问答系统,支持自定义语料☆10Jun 25, 2023Updated 2 years ago
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 2 years ago
- A tiny search engine.☆13Sep 6, 2022Updated 3 years ago
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading t…☆20Jan 20, 2026Updated last month
- P^2HCT: Plug-and-Play Hierarchical C2F Transformer for Multi-Scale Feature Fusion☆24May 19, 2025Updated 9 months ago
- ☆14Sep 6, 2024Updated last year
- The resources for the paper "User Modeling with Click Preference and Reading Satisfaction for News Recommendation"