Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
☆25Feb 25, 2025Updated last year
Alternatives and similar repositories for Patch_Scaling
Users that are interested in Patch_Scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆83Feb 27, 2025Updated last year
- Official PyTorch implementation of Agglomerative Token Clustering presented at ECCV 2024☆20Sep 19, 2024Updated last year
- ☆20Jan 23, 2024Updated 2 years ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Apr 15, 2024Updated 2 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- Dynamic config system based on python classes☆12Jan 27, 2023Updated 3 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated 2 years ago
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆46Jun 27, 2025Updated last year
- ☆57Sep 28, 2023Updated 2 years ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆90May 30, 2025Updated last year
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆14Mar 28, 2025Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆29Apr 22, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2026 Highlight] PersonaVLM: Long-Term Personalized Multimodal LLMs☆108Apr 16, 2026Updated 2 months ago
- This is the official implementation of the paper “Griffin: Towards a Graph-Centric Relational Database Foundation Model.”☆40Sep 25, 2025Updated 9 months ago
- ☆136Jun 26, 2024Updated 2 years ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- [NeurIPS24] VisMin: Visual Minimal-Change Understanding☆19Mar 3, 2025Updated last year
- ☆25Apr 10, 2025Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- ☆25Feb 14, 2025Updated last year
- cliptrase☆48Sep 1, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Kernel Herding for probability density estimation☆14Feb 23, 2016Updated 10 years ago
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆22Aug 5, 2025Updated 10 months ago
- ☆10Apr 8, 2021Updated 5 years ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆21Mar 13, 2025Updated last year
- ☆16Jul 9, 2025Updated 11 months ago
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆107Jun 24, 2026Updated last week
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆18Oct 22, 2024Updated last year
- [WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".☆36Dec 22, 2024Updated last year
- ☆32Dec 1, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A yolov5 based application, it uses the prediction results by yolov5 to activate the selected opencv built-in tracking algorithm.☆10Jul 24, 2020Updated 5 years ago
- An Empirical Comparison of Unsupervised Constituency Parsing Methods☆14Aug 15, 2021Updated 4 years ago
- [AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data☆38Apr 7, 2025Updated last year
- Official Implementation of paper "Distilling Long-tailed Datasets" [CVPR 2025]☆22Aug 13, 2025Updated 10 months ago
- ☆34Sep 19, 2025Updated 9 months ago
- Fast and Modularized CFG-focused Models☆23Nov 8, 2023Updated 2 years ago
- This is the source code for: Context-aware Entity Typing in Knowledge Graphs.☆16May 10, 2022Updated 4 years ago