Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]
☆192Mar 30, 2026Updated 2 months ago
Alternatives and similar repositories for Penguin-VL
Users that are interested in Penguin-VL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 2 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆18Mar 18, 2026Updated 2 months ago
- This is a project on visual spatial reasoning tasks-SIBench☆26Jan 12, 2026Updated 4 months ago
- ☆71May 2, 2026Updated last month
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆106Mar 15, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆26Dec 21, 2025Updated 5 months ago
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…☆74Apr 12, 2026Updated last month
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆130Apr 7, 2026Updated 2 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆49Apr 22, 2026Updated last month
- Northwestern Polytechnical University 2024 Undergraduate Graduation Design Thesis LaTeX Template☆12Sep 26, 2025Updated 8 months ago
- Evaluation codes and data for GenEval2☆73Jan 8, 2026Updated 5 months ago
- ☆49Updated this week
- Optimizing Monocular Depth Estimation with TensorRT: Model Conversion, Inference Acceleration, and 3D Reconstruction☆49Mar 9, 2026Updated 2 months ago
- Just prepare config file and start training your metric learning model with ease☆16May 20, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆12Sep 22, 2024Updated last year
- Code for AAAI 2025 paper "OTLRM: Orthogonal Learning-based Low-Rank Metric for Multi-Dimensional Inverse Problems".☆18Dec 21, 2024Updated last year
- Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models☆108Jan 14, 2026Updated 4 months ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- [ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark☆153May 4, 2026Updated last month
- Implementation of paper - RepVGG-GELAN: ENHANCED GELAN WITH VGG-STYLE CONVNETS FOR BRAIN TUMOR DETECTION☆10Jul 19, 2025Updated 10 months ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 10 months ago
- A replication of Google's VideoPoet model☆12Feb 18, 2024Updated 2 years ago
- ☆12Aug 10, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DEYOv1.5☆29Jul 22, 2024Updated last year
- ☆71Nov 18, 2024Updated last year
- ☆11Jul 26, 2024Updated last year
- [CVPR2026 🌟] The first attempt to Marine Open Vocabulary Instance Segmentation☆51May 8, 2026Updated last month
- Audio-video joint generation☆58Nov 27, 2025Updated 6 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 5 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆70Jul 22, 2025Updated 10 months ago
- ONNXモデルをpyca/cryptographyを用いて暗号化/復号化するサンプル☆16Mar 19, 2022Updated 4 years ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆62Nov 4, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A paper list of panoptic segmentation using deep learning☆12Sep 5, 2021Updated 4 years ago
- Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…☆10Oct 1, 2024Updated last year
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆24Sep 3, 2025Updated 9 months ago
- [ICCV 2025] SAM4D: Segment Anything in Camera and LiDAR Streams☆231Sep 23, 2025Updated 8 months ago
- ☆14Jan 2, 2025Updated last year
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆108Oct 6, 2025Updated 8 months ago
- The official implementation of "Label-efficient Semantic Scene Completion with Scribble Annotations" (IJCAI 2024)☆14Jul 27, 2024Updated last year