[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
☆134May 4, 2026Updated 2 weeks ago
Alternatives and similar repositories for Zooming-without-Zooming
Users that are interested in Zooming-without-Zooming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ICML 2024) PyTorch implementation of "Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes"☆16Oct 15, 2024Updated last year
- A model combining Deep Neural Networks and (Stochastic) Random Forests.☆14Jun 5, 2018Updated 7 years ago
- A simple visual test-time scaling method for GUI agent grounding☆25Dec 7, 2025Updated 5 months ago
- ☆24Sep 12, 2024Updated last year
- [SCI-FM@ICLR 2025] Specialized LLMs capable of handling various diabetes tasks☆56Oct 20, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Jan 28, 2026Updated 3 months ago
- EMMA [TMLR 2025]☆12Sep 25, 2025Updated 7 months ago
- Official repository Flash Local Linear Attention☆23Apr 23, 2026Updated 3 weeks ago
- Official Pytorch implementation of NeuralWalker (ICLR 2025)☆39Jun 25, 2025Updated 10 months ago
- ☆26Feb 13, 2026Updated 3 months ago
- ☆23Aug 20, 2024Updated last year
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 7 months ago
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆29Jul 15, 2025Updated 10 months ago
- [MICCAI2023] XSurv: Merging-Diverging Hybrid Transformer Networks for Survival Prediction☆11Oct 2, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21Nov 27, 2025Updated 5 months ago
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆25Aug 15, 2024Updated last year
- Image Classification Tutorial: ConvNext--> 98.8% on CIFAR10 + 92.4% on CIFAR100; ResNet18 -- 95.6% on CIFAR10 + 79.1% on CIFAR100☆15Jun 2, 2025Updated 11 months ago
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- “SURE: SUrvey REcipes for building reliable and robust deep networks” (CVPR 2024) & (ECCV 2024 OOD-CV Challenge Winner)☆75Aug 21, 2025Updated 8 months ago
- BusterX and BusterX++☆38Mar 9, 2026Updated 2 months ago
- ☆23Apr 29, 2025Updated last year
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆149Apr 15, 2026Updated last month
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆92Mar 9, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"☆62Mar 17, 2026Updated 2 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Jun 11, 2025Updated 11 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆31Nov 13, 2025Updated 6 months ago
- MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning☆40May 7, 2026Updated last week
- Scaling Long-Horizon LLM Agent via Context-Folding☆155Jan 26, 2026Updated 3 months ago
- Setup scripts for the WebArena benchmark☆22Jun 19, 2025Updated 11 months ago
- CHEMSMART: Chemistry Simulation and Modeling Automation Toolkit☆36May 12, 2026Updated last week
- Computing calibrated prediction intervals for neural network regressors☆10May 28, 2019Updated 6 years ago
- Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]☆188Mar 30, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is an official PyTorch implementation of ASDA (accepted by ACMMM 2024).☆26Oct 22, 2024Updated last year
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated 3 months ago
- ☆18May 14, 2025Updated last year
- Code and data for NAACL 2022 paper Few-Shot Document-Level Relation Extraction☆27Nov 10, 2022Updated 3 years ago
- This repository contains the code for UNETR: Transformers for 3D Medical Image Segmentation [1]. UNETR is the first 3D segmentation netwo…☆15Jul 8, 2022Updated 3 years ago
- ☆13Jan 14, 2026Updated 4 months ago
- ☆23Jan 9, 2026Updated 4 months ago