The first reimplementation of paperswithcode website.
☆92Sep 12, 2025Updated 6 months ago
Alternatives and similar repositories for papers-with-code
Users that are interested in papers-with-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch Implementation of DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction (Xu et al., 2019)☆11Jun 6, 2021Updated 4 years ago
- Official implementation of CVPR 2020 paper "Front2Back: Single View 3D Shape Reconstruction via Front to Back Prediction"☆12Aug 20, 2021Updated 4 years ago
- // 个人简历☆10Feb 25, 2019Updated 7 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated 2 weeks ago
- ☆13Feb 26, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆21Sep 10, 2024Updated last year
- GraspFast: Multi-stage Lightweight 6-DoF Grasp Pose Detection with RGB-D Image☆24Jun 20, 2025Updated 9 months ago
- ☆12Mar 13, 2021Updated 5 years ago
- Temporal and Causal Reasoning (dataset)☆10Apr 19, 2022Updated 3 years ago
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated last month
- Implementation of "Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors" ECCV'20 paper.☆14Feb 14, 2021Updated 5 years ago
- MixMatch Domain Adaptation: Prize-winning solution for both tracks of VisDA 2019 challenge☆23Mar 24, 2023Updated 3 years ago
- VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding☆54Updated this week
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Aug 12, 2022Updated 3 years ago
- An official implementation of "Gaussian Herding Across Pens: an optimal transport perspective on global gaussian reduction for 3DGS"☆42Dec 17, 2025Updated 3 months ago
- Global Ionospheric Mapping with GNSS☆17Aug 9, 2023Updated 2 years ago
- A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning☆37Mar 12, 2026Updated 2 weeks ago
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆16Jan 31, 2024Updated 2 years ago
- The is the released codes for Single-view 3D Mesh Reconstruction for Seen and Unseen Categories☆21Jun 4, 2023Updated 2 years ago
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors☆38Aug 7, 2025Updated 7 months ago
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆46Jun 19, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization☆18Dec 15, 2025Updated 3 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…☆36Jul 26, 2024Updated last year
- Code for the SIGGRAPH 2021 paper "Consistent Depth of Moving Objects in Video".☆14Aug 27, 2023Updated 2 years ago
- Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features☆76Apr 8, 2025Updated 11 months ago
- Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]☆10Jul 22, 2024Updated last year
- Code and Data for "SCTc-TE: A Comprehensive Formulation and Benchmark for Temporal Event Forecasting""☆16Feb 2, 2024Updated 2 years ago
- The source code of Mem-Gallery: Benchmarking Multimodal Long-Term Conversational Memory for MLLM Agents.☆38Jan 31, 2026Updated last month
- Multigranularity Contrastive cross-modal collaborative Generation (MCG) model for Video QA☆11Dec 13, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Optimizer classes for aslam_cv, kalibr, aslam_incremental_calibration, ..☆38Mar 1, 2023Updated 3 years ago
- PyTorch implementation of SketchSampler: Sketch-based 3D Reconstruction via View-dependent Depth Sampling, ECCV2022.☆28Jul 19, 2022Updated 3 years ago
- The public reproducible analysis code used for the gaze project☆11Feb 21, 2026Updated last month
- 关于英语学习的微信小程序前端代码☆20Mar 26, 2020Updated 6 years ago
- ☆31May 13, 2023Updated 2 years ago
- Nifi Processors for ingesting and converting geo data using GeoMesa and GeoTools☆33Mar 16, 2026Updated last week
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year