[ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
☆385Feb 18, 2026Updated 2 weeks ago
Alternatives and similar repositories for Puffin
Users that are interested in Puffin are comparing it to the libraries listed below
Sorting:
- Attention-based Deep Reinforcement Learning framework for portfolio allocation on S&P 500 equities. Includes custom environment, policy a…☆163Oct 16, 2025Updated 4 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆73Updated this week
- One-click synchronization tool for MCP configuration☆45Jan 29, 2026Updated last month
- ☆108Jan 9, 2022Updated 4 years ago
- VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization☆19Jan 17, 2025Updated last year
- ☆56Dec 8, 2025Updated 3 months ago
- A navigation algorithm based on CMU team's open-source local planner☆118Oct 9, 2025Updated 5 months ago
- ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Unde…☆35Updated this week
- 极不平衡样本下的预测☆40Oct 28, 2025Updated 4 months ago
- ☆112Oct 16, 2025Updated 4 months ago
- End-to-end tool for Grounded Theory research, featuring Segmentation, Open Coding, Gioia Method, Axial (CAR) Coding, Selective Coding, Ne…☆54Feb 24, 2026Updated last week
- Astron-xmod-shim — Lightweight, declarative middleware for reliably converging AI service workloads.☆100Nov 3, 2025Updated 4 months ago
- Official PyTorch implementation of GroupKAN: Rethinking Nonlinearity with Grouped Spline-based KAN Modeling for Efficient Medical Image S…☆86Nov 7, 2025Updated 4 months ago
- ☆29Aug 6, 2025Updated 7 months ago
- 一个在 JetBrains 上的插件:Tree Description 。可以为项目模块增加自定义备注,颜色分类、标注用途,还可以共享开源映射关系。☆212Jan 26, 2026Updated last month
- For dynamic target tracking in flight videos, applicable to various types of unmanned aerial vehicle systems☆84Dec 4, 2025Updated 3 months ago
- Codebase of GRPO: Implementations and Resources of GRPO and Its Variants☆225Dec 6, 2025Updated 3 months ago
- An educational Rust relational database (RDBMS) inspired by CMU 15445☆176Jan 19, 2026Updated last month
- RSeata - A Rust implementation of distributed transaction framework, supporting AT & XA modes, SeaORM integration, and gRPC-based context…☆86Nov 14, 2025Updated 3 months ago
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 3 months ago
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆918Feb 27, 2026Updated last week
- Full life cycle cross providers serverless application management for your fast-growing business.☆87Updated this week
- 🍑 relsim: Relational Visual Similarity | pip install relsim 🌍 (CVPR 2026)☆65Feb 21, 2026Updated 2 weeks ago
- Official code for paper: "RayRoPE: Projective Ray Positional Encoding for Multi-view Attention"☆106Feb 25, 2026Updated last week
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆15Feb 23, 2026Updated 2 weeks ago
- [FG 2019 Oral] Attribute-Guided Sketch Generation☆10Jul 25, 2021Updated 4 years ago
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens☆20Oct 12, 2025Updated 4 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆165Sep 29, 2025Updated 5 months ago
- Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"☆260Jan 20, 2026Updated last month
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- An unofficial reproduction of Sela et al. "Computational caricaturization of surfaces". CVIU 2015.☆13Nov 27, 2020Updated 5 years ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆134Nov 4, 2025Updated 4 months ago
- DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG retrieval.☆330Updated this week
- Interactively browse multimodal tabular data☆104Feb 11, 2026Updated 3 weeks ago
- ☆72Oct 18, 2025Updated 4 months ago
- Feed-forward model for predicting 3D physics with 3DGS + NeRF☆280Updated this week
- [CVPR 2026] ZipMap: Linear-Time Stateful 3D Reconstruction via Test-Time Training☆178Updated this week
- [AAAI2025] This is the official PyTorch codes for the paper: "DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts"☆23Jun 16, 2025Updated 8 months ago