Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]
☆104Mar 9, 2026Updated this week
Alternatives and similar repositories for Penguin-VL
Users that are interested in Penguin-VL are comparing it to the libraries listed below
Sorting:
- The official repo for the DanQing dataset.☆30Jan 16, 2026Updated last month
- ☆56Nov 12, 2025Updated 3 months ago
- [NAACL 2025 Main] AgentMove: A Large Language Model based Agentic Framework for Zero-shot Next Location Prediction.☆44Jul 26, 2025Updated 7 months ago
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆38Updated this week
- ☆24Feb 4, 2026Updated last month
- Just prepare config file and start training your metric learning model with ease☆16Apr 2, 2024Updated last year
- ☆14Mar 2, 2026Updated last week
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆30Feb 24, 2026Updated 2 weeks ago
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆86Jan 27, 2025Updated last year
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated last month
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- Resources for the Enigmata Project.☆79Aug 13, 2025Updated 6 months ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆33Nov 11, 2025Updated 4 months ago
- Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".☆128Jan 24, 2026Updated last month
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 3 weeks ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago
- 在 Mirai Console 中使用MCL管理包和其他高级功能☆10Nov 13, 2022Updated 3 years ago
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated last month
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago
- Code accompanying our ICML 2020 paper on choice set optimization in group decision-making.☆11Jun 27, 2020Updated 5 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- All-in-One Safety Evaluation Framwork☆42Updated this week
- Python solutions to coding questions in Leetcode☆13Sep 12, 2020Updated 5 years ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆15Jul 15, 2025Updated 7 months ago
- Pythonic Nvidia Codec Library☆17Feb 23, 2026Updated 2 weeks ago
- ☆12Apr 13, 2019Updated 6 years ago
- My templates used in OI. All C++.☆11Jul 17, 2018Updated 7 years ago
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago
- ☆12Nov 2, 2021Updated 4 years ago
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆20Feb 20, 2026Updated 2 weeks ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆16Aug 15, 2025Updated 6 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control☆37Feb 22, 2026Updated 2 weeks ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated last month
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…☆65Feb 18, 2026Updated 2 weeks ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 8 months ago
- A scalable data preprocessing framework built on PySpark for LLM training☆23Dec 9, 2025Updated 3 months ago