Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]
☆188Mar 30, 2026Updated last month
Alternatives and similar repositories for Penguin-VL
Users that are interested in Penguin-VL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo for the DanQing dataset.☆35Mar 25, 2026Updated last month
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆17Mar 18, 2026Updated 2 months ago
- Toy-scale unified multimodal model experiments — encoder-free understanding & generation with Mixture-of-Transformers on MLX/Apple Silico…☆42Mar 8, 2026Updated 2 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆99Mar 15, 2026Updated 2 months ago
- Implementation of End-to-End YOLO Models☆10Dec 30, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆22Dec 21, 2025Updated 4 months ago
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…☆74Apr 12, 2026Updated last month
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆85Jan 27, 2025Updated last year
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆129Apr 7, 2026Updated last month
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆47Apr 22, 2026Updated 3 weeks ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Mar 12, 2026Updated 2 months ago
- Official code repository of '3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model'☆56Mar 20, 2026Updated 2 months ago
- Triton Migration Guide for DeepStreamSDK.☆15Dec 19, 2023Updated 2 years ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆43Jan 29, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation☆115Apr 2, 2026Updated last month
- Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on custom datasets.☆34Dec 3, 2025Updated 5 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:阿里云金融大模型)☆447May 10, 2026Updated last week
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆113Jan 14, 2026Updated 4 months ago
- Evaluation codes and data for GenEval2☆72Jan 8, 2026Updated 4 months ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 5 months ago
- ☆47Apr 4, 2026Updated last month
- Optimizing Monocular Depth Estimation with TensorRT: Model Conversion, Inference Acceleration, and 3D Reconstruction☆46Mar 9, 2026Updated 2 months ago
- ☆116Dec 28, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Just prepare config file and start training your metric learning model with ease☆16Apr 2, 2024Updated 2 years ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- [ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark☆134May 4, 2026Updated 2 weeks ago
- Implementation of paper - RepVGG-GELAN: ENHANCED GELAN WITH VGG-STYLE CONVNETS FOR BRAIN TUMOR DETECTION☆10Jul 19, 2025Updated 10 months ago
- A replication of Google's VideoPoet model☆12Feb 18, 2024Updated 2 years ago
- ACL24☆11Jun 7, 2024Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆23May 8, 2026Updated last week
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Aug 10, 2022Updated 3 years ago
- ☆71Nov 18, 2024Updated last year
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Oct 4, 2024Updated last year
- ☆11Jul 26, 2024Updated last year
- [Neurocomputing] Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation☆25Dec 21, 2025Updated 4 months ago
- We propose to tackle the multiview photometric stereo problem using an extension of Neural Radiance Fields (NeRFs), conditioned on light …☆11Jan 11, 2023Updated 3 years ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆69Jul 22, 2025Updated 9 months ago