[NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text
☆37Jan 31, 2026Updated 4 months ago
Alternatives and similar repositories for Pancap
Users that are interested in Pancap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆35Apr 8, 2025Updated last year
- (NeurIPS 2024) Official repository of paper "Grasp as You Say: Language-guided Dexterous Grasp Generation"☆65Mar 30, 2026Updated 2 months ago
- ☆25Jul 24, 2024Updated last year
- The official implementation of MotionGrasp☆38Nov 15, 2025Updated 6 months ago
- (CVPR 2026) Official repository of paper "WeDetect: Fast Open-Vocabulary Object Detection as Retrieval"☆213Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [IJCV 2024] Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks☆15Aug 30, 2024Updated last year
- ☆45Jun 21, 2024Updated last year
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆117Feb 5, 2026Updated 4 months ago
- [ICLR2025] HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts☆21Aug 1, 2025Updated 10 months ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- Implementation for paper "Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models".☆105May 17, 2026Updated 3 weeks ago
- Official Code for Dexterous Grasp Transformer (CVPR 2024)☆66Oct 13, 2025Updated 7 months ago
- ☆22May 26, 2025Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆15Dec 4, 2025Updated 6 months ago
- [ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- SIOD: Single Instance Annotated Per Category Per Image for Object Detection (单实例标注目标检测)☆27Apr 4, 2022Updated 4 years ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆48Mar 18, 2026Updated 2 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆21Jul 10, 2025Updated 10 months ago
- [NeurIPS 2025] Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation☆60Dec 18, 2025Updated 5 months ago
- Chatbot_CN项目的Chatbot_Doc模块☆19May 17, 2020Updated 6 years ago
- (ECCV 2024) Official implementation of the Economic 6-DoF Grasp Detection Framework (EconomicGrasp).☆115Apr 12, 2026Updated last month
- VLM benchmarks for robot manipulation tasks☆22Apr 30, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2023] Diversifying Spatial-Temporal Perception for Video Domain Generalization☆17Oct 26, 2023Updated 2 years ago
- ☆41Mar 6, 2026Updated 3 months ago
- The first work for cross-domain open-vocabulary action recognition with a benchmark☆21May 27, 2024Updated 2 years ago
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- ☆18Jul 8, 2025Updated 11 months ago
- Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation☆45Aug 5, 2025Updated 10 months ago
- [AAAI 2024] MLNet: Mutual Learning Network with Neighborhood Invariance for Universal Domain Adaptation☆21Feb 29, 2024Updated 2 years ago
- This is the official repo for [CVPR 2025] paper, Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipul…☆31Mar 31, 2025Updated last year
- TensorRT for RefineNet Segmentation☆12Apr 27, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"☆72Apr 8, 2026Updated 2 months ago
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models☆63Jan 3, 2026Updated 5 months ago
- ☆16Nov 2, 2016Updated 9 years ago
- Code for the paper "Conditional Representation Learning for Customized Tasks" (NeurIPS 2025 Spotlight)☆46Oct 11, 2025Updated 7 months ago
- Official PyTorch implementation of the ICML 2024 paper "Hyperbolic Active Learning for Semantic Segmentation under Domain Shift"☆25Nov 26, 2024Updated last year
- This is the official code repo for GLOVER and GLOVER++.☆55Aug 6, 2025Updated 10 months ago