[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
☆153May 4, 2026Updated last month
Alternatives and similar repositories for Zooming-without-Zooming
Users that are interested in Zooming-without-Zooming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of POEM (Out-of-distribution detection with posterior sampling), ICML 2022☆28May 6, 2023Updated 3 years ago
- ☆24Sep 12, 2024Updated last year
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Apr 4, 2025Updated last year
- [SCI-FM@ICLR 2025] Specialized LLMs capable of handling various diabetes tasks☆57Oct 20, 2025Updated 7 months ago
- [ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62May 26, 2026Updated 2 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- EMMA [TMLR 2025]☆13Sep 25, 2025Updated 8 months ago
- Official repository Flash Local Linear Attention☆36May 28, 2026Updated last week
- Official Pytorch implementation of NeuralWalker (ICLR 2025)☆40Jun 25, 2025Updated 11 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 7 months ago
- [MICCAI2023] XSurv: Merging-Diverging Hybrid Transformer Networks for Survival Prediction☆11Oct 2, 2023Updated 2 years ago
- ☆20Nov 27, 2025Updated 6 months ago
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tu…☆97Mar 6, 2026Updated 3 months ago
- Image Classification Tutorial: ConvNext--> 98.8% on CIFAR10 + 92.4% on CIFAR100; ResNet18 -- 95.6% on CIFAR10 + 79.1% on CIFAR100☆15Jun 2, 2025Updated last year
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- “SURE: SUrvey REcipes for building reliable and robust deep networks” (CVPR 2024) & (ECCV 2024 OOD-CV Challenge Winner)☆76Aug 21, 2025Updated 9 months ago
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆24Mar 25, 2026Updated 2 months ago
- BusterX and BusterX++☆40Mar 9, 2026Updated 3 months ago
- ☆23Mar 17, 2026Updated 2 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Jun 11, 2025Updated 11 months ago
- Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"☆63Mar 17, 2026Updated 2 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆97Mar 9, 2026Updated 3 months ago
- [ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection☆51Jul 30, 2025Updated 10 months ago
- Official pytorch implementation of 'Relation-aware Language-Graph Transformer for Question Answering' (AAAI 2023)☆18Apr 25, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning☆40May 7, 2026Updated last month
- Code for IJCAI 2023 paper 'SLViT: Scale-Wise Language-Guided Vision Transformer for Referring Image Segmentation'☆11May 28, 2023Updated 3 years ago
- [ICML'26] Scaling Long-Horizon LLM Agent via Context-Folding☆161May 18, 2026Updated 3 weeks ago
- CHEMSMART: Chemistry Simulation and Modeling Automation Toolkit☆37Updated this week
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆59May 28, 2025Updated last year
- Computing calibrated prediction intervals for neural network regressors☆10May 28, 2019Updated 7 years ago
- Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]☆192Mar 30, 2026Updated 2 months ago
- ☆51Jan 27, 2026Updated 4 months ago
- This repository contains the code for UNETR: Transformers for 3D Medical Image Segmentation [1]. UNETR is the first 3D segmentation netwo…☆16Jul 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆39Feb 20, 2026Updated 3 months ago
- ☆22Jan 9, 2026Updated 5 months ago
- ☆13Jan 14, 2026Updated 4 months ago
- ☆45Oct 23, 2025Updated 7 months ago
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆12Sep 22, 2024Updated last year
- Use 2 lines to empower absolute time awareness for Qwen2.5VL's MRoPE☆29Sep 20, 2025Updated 8 months ago
- Wind visualization over time☆101Oct 23, 2025Updated 7 months ago