Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]
☆195Mar 30, 2026Updated 2 months ago
Alternatives and similar repositories for Penguin-VL
Users that are interested in Penguin-VL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 3 months ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆19Mar 18, 2026Updated 3 months ago
- This is a project on visual spatial reasoning tasks-SIBench☆26Jan 12, 2026Updated 5 months ago
- ☆75May 2, 2026Updated last month
- Implementation of End-to-End YOLO Models☆10Dec 30, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆107Mar 15, 2026Updated 3 months ago
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆85Jan 27, 2025Updated last year
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆135Apr 7, 2026Updated 2 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆50Apr 22, 2026Updated 2 months ago
- WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation☆157Jun 18, 2026Updated last week
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆58Mar 12, 2026Updated 3 months ago
- Official code repository of '3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model'☆59Mar 20, 2026Updated 3 months ago
- Triton Migration Guide for DeepStreamSDK.☆15Dec 19, 2023Updated 2 years ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆43Jan 29, 2026Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation☆125Apr 2, 2026Updated 2 months ago
- ☆50Jun 4, 2026Updated 3 weeks ago
- Evaluation codes and data for GenEval2☆74Jan 8, 2026Updated 5 months ago
- Optimizing Monocular Depth Estimation with TensorRT: Model Conversion, Inference Acceleration, and 3D Reconstruction☆49Mar 9, 2026Updated 3 months ago
- ☆116Dec 28, 2025Updated 6 months ago
- SimKO: Simple Pass@K Policy Optimization☆30Oct 24, 2025Updated 8 months ago
- [ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark☆160May 4, 2026Updated last month
- Implementation of paper - RepVGG-GELAN: ENHANCED GELAN WITH VGG-STYLE CONVNETS FOR BRAIN TUMOR DETECTION☆10Jul 19, 2025Updated 11 months ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ACL24☆11Jun 7, 2024Updated 2 years ago
- A replication of Google's VideoPoet model☆12Feb 18, 2024Updated 2 years ago
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆23May 8, 2026Updated last month
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆13Jun 21, 2026Updated last week
- ☆12Aug 10, 2022Updated 3 years ago
- DEYOv1.5☆29Jul 22, 2024Updated last year
- We propose to tackle the multiview photometric stereo problem using an extension of Neural Radiance Fields (NeRFs), conditioned on light …☆11Jan 11, 2023Updated 3 years ago
- [ACMMM 2025] "Casual3DHDR: High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos"☆27Sep 26, 2025Updated 9 months ago
- [IJCAI 2023] CLE-ViT: Contrastive Learning Encoded Transformer for Ultra-Fine-Grained Visual Categorization.☆10Nov 3, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of "Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model"☆271Apr 25, 2026Updated 2 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 6 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆70Jul 22, 2025Updated 11 months ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆65Nov 4, 2025Updated 7 months ago
- (CVPR 2026 Highlight) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject co…☆32Apr 9, 2026Updated 2 months ago
- Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…☆10Oct 1, 2024Updated last year
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆24Sep 3, 2025Updated 9 months ago