yujunhuics/Reyes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yujunhuics/Reyes)

yujunhuics / Reyes

2025.01：从零到一实现了一个多模态大模型，并命名为Reyes（睿视），R：睿，eyes：眼。Reyes的参数量为8B，视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct，Reyes也通过一个两层MLP投影层连接视觉编码器与语言模型。2026.01：reyes-0.6B

☆34

Alternatives and similar repositories for Reyes

Users that are interested in Reyes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

taishan1994 / MiniClip
View on GitHub
动手训练一个简单的CLIP模型，加深对CLIP的理解。
☆27May 20, 2025Updated last year
MengLcool / DeepStack-VL
View on GitHub
[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…
☆93Jun 17, 2024Updated 2 years ago
miaoshuyu / awesome-attention-pytorch
View on GitHub
Some brief implementation of awesome attention blocks like SeNet, CBAM, DANet, A2attention and so on.
☆10May 11, 2020Updated 6 years ago
libing64 / Qwen2.5-VL-Fine-Tuning
View on GitHub
☆34Mar 2, 2025Updated last year
WillDreamer / Awesome-MLLM-Reasoning
View on GitHub
Recent Advances on MLLM's Reasoning Ability
☆26Apr 11, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
MaskerPRC / auto-vl-spider
View on GitHub
The use of multimodal large model technology enables automatic generation of web scraping code. 使用多模态大模型技术实现了爬虫代码自动生成
☆22Jan 7, 2025Updated last year
justus-comnets / 5g-campus-measurements
View on GitHub
☆12Aug 17, 2022Updated 3 years ago
med-air / HeteroPFL
View on GitHub
[ICLR'24] Heterogeneous Personalized Federated Learning by Local-Global Updates Mixing via Convergence Rate
☆13Jun 17, 2025Updated last year
morning-hao / domain-self-instruct
View on GitHub
受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果，通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 3 years ago
SCNU203 / GeoQA-Plus
View on GitHub
☆20May 14, 2024Updated 2 years ago
alibaba-damo-academy / VL-Cogito
View on GitHub
☆24Nov 4, 2025Updated 8 months ago
dhyuan99 / VecKM_flow
View on GitHub
Official GitHub repo for Learning Normal Flow Directly from Event Neighborhoods (ICCV2025). It is an easy-to-use API for event-based norm…
☆23Oct 5, 2025Updated 9 months ago
LehengTHU / AdvInfoNCE
View on GitHub
[NeurIPS 2023] The implementation of paper "Empowering Collaborative Filtering Generalization via Principled Adversarial Contrastive Loss…
☆21Feb 21, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
opendatalab / image-downloader
View on GitHub
☆31May 13, 2024Updated 2 years ago
Sherrylife / FedLMT
View on GitHub
[ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…
☆14Sep 22, 2024Updated last year
lucasjinreal / Namo-R1
View on GitHub
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
☆256Apr 22, 2025Updated last year
itsOwen / BetterNet
View on GitHub
BetterNet is a state-of-the-art deep learning model for accurate and efficient polyp segmentation in medical images. It combines Efficien…
☆14May 8, 2024Updated 2 years ago
Ancientshi / ERM4
View on GitHub
Enhancing Retrieval and Managing Retrieval: 4-Module Synergy
☆23Dec 7, 2024Updated last year
thunlp / Muffin
View on GitHub
☆65Feb 5, 2024Updated 2 years ago
pagand / e2etransfuser
View on GitHub
☆17Sep 9, 2024Updated last year
om-ai-lab / VLM-FO1
View on GitHub
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
☆330Jun 18, 2026Updated last month
NKU-MetautoAI / awesome-large-vision-language-models
View on GitHub
Advances in recent large vision language models (LVLMs)
☆15Sep 23, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
naseemap47 / CustomActionRecognition-TensorFlow-CNN-LSTM
View on GitHub
Creating Custom Action Recognition Model using TensorFlow (CNN + LSTM)
☆12Feb 22, 2023Updated 3 years ago
newocean-group / T-Rex2
View on GitHub
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
☆29May 13, 2025Updated last year
beichao1314 / Open-Llama
View on GitHub
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆68Mar 27, 2023Updated 3 years ago
sunshineatnoon / Single-Layer-CNN-on-MNIST
View on GitHub
A single Layer CNN on MIST, get an acurray of 97.24%
☆11Jun 12, 2015Updated 11 years ago
Vincent-ZHQ / Comprehensive-Long-Video-Understanding-Survey
View on GitHub
A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long…
☆23Sep 12, 2025Updated 10 months ago
reachpranjal / lego-drive
View on GitHub
[Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective
☆28Apr 4, 2024Updated 2 years ago
nttmdlab-nlp / VDocRAG
View on GitHub
[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents
☆66May 26, 2025Updated last year
tub-rip / event_penguins
View on GitHub
The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)
☆13Sep 25, 2024Updated last year
tub-rip / event_collapse
View on GitHub
On solutions to the problem of Event Collapse in Motion Compensation frameworks
☆15Jan 21, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kyrieLei / Critic-V
View on GitHub
☆18Apr 23, 2025Updated last year
lemon-little / BetterSynth
View on GitHub
天池Better Synth多模态大模型数据合成挑战赛-打赢baseline就算成功方案
☆30Oct 30, 2025Updated 8 months ago
spicywagyu04 / CADParser
View on GitHub
☆29May 30, 2024Updated 2 years ago
geshang777 / Seg-R1
View on GitHub
[NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"
☆72Jul 1, 2025Updated last year
Ekoda / SoftMoE
View on GitHub
Soft Mixture of Experts Vision Transformer, addressing MoE limitations as highlighted by Puigcerver et al., 2023.
☆16Aug 13, 2023Updated 2 years ago
samakos / Document-AI-
View on GitHub
☆14Aug 31, 2023Updated 2 years ago
LingyvKong / OneChart
View on GitHub
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆265Apr 14, 2025Updated last year