The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆16Sep 12, 2025Updated 6 months ago
Alternatives and similar repositories for PixelWorld
Users that are interested in PixelWorld are comparing it to the libraries listed below
Sorting:
- [AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning☆12Dec 10, 2023Updated 2 years ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆16Aug 3, 2025Updated 7 months ago
- STFNet: Self-supervised Transformer for Infrared and Visible Image Fusion☆12Mar 25, 2024Updated last year
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆21Jan 8, 2026Updated 2 months ago
- ☆11Jun 28, 2024Updated last year
- ☆40Jan 12, 2026Updated 2 months ago
- ☆17Oct 8, 2024Updated last year
- ☆17Feb 20, 2024Updated 2 years ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 7 months ago
- ☆20May 14, 2024Updated last year
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆35Apr 16, 2025Updated 11 months ago
- Dynamic Importance Sampling☆14Feb 13, 2022Updated 4 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- AAAI 2025 | A2RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion☆31Oct 10, 2025Updated 5 months ago
- ☆12Mar 22, 2025Updated 11 months ago
- ☆11Sep 1, 2024Updated last year
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Sep 22, 2024Updated last year
- ☆14May 20, 2025Updated 10 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆41Dec 27, 2023Updated 2 years ago
- HeyGem for RTX50 series☆42May 7, 2025Updated 10 months ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- This is the officially implementation of ICCV 2023 paper " Learning A Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with…☆11Dec 7, 2023Updated 2 years ago
- code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.☆11Sep 29, 2024Updated last year
- Have an AI debate against you on any topic of your choosing☆15Oct 13, 2024Updated last year
- decontamination☆26Mar 4, 2026Updated 2 weeks ago
- ☆47Dec 11, 2023Updated 2 years ago
- ☆19Mar 8, 2023Updated 3 years ago
- ☆11Feb 7, 2025Updated last year
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Feb 25, 2021Updated 5 years ago
- [TCSVT2023] [LASNet] RGB-T Semantic Segmentation with Location, Activation, and Sharpening☆31Jan 13, 2026Updated 2 months ago
- A collection of important papers on Generalizable Diffusion-generated Image Detection☆17Mar 20, 2025Updated last year
- LocalHost of PIA in Windows☆14Dec 25, 2023Updated 2 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- ☆10Apr 8, 2024Updated last year
- Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving☆16Feb 10, 2025Updated last year
- An asymmetric 1v1 multiplayer game using Unreal Engine☆18Feb 25, 2017Updated 9 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year