The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆16Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for PixelWorld
Users that are interested in PixelWorld are comparing it to the libraries listed below
Sorting:
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 6 months ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆16Aug 3, 2025Updated 6 months ago
- [AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning☆12Dec 10, 2023Updated 2 years ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆20Jan 8, 2026Updated last month
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Sep 22, 2024Updated last year
- ☆39Jan 12, 2026Updated last month
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Feb 25, 2021Updated 5 years ago
- This is llvm-nmx backend compiler☆12Aug 22, 2023Updated 2 years ago
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆20Feb 17, 2026Updated last week
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆28Feb 18, 2026Updated last week
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆35Apr 16, 2025Updated 10 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- [CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆82Feb 13, 2026Updated 2 weeks ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- ☆53Feb 10, 2025Updated last year
- DragMesh: Interactive 3D Generation Made Easy☆20Dec 28, 2025Updated 2 months ago
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆14Sep 23, 2025Updated 5 months ago
- 用于深度哈希图像检索和深度哈希跨模态检索的性能评估算法的计算脚本☆13Oct 30, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- A powerful integration that combines Browserbase's Stagehand with Mastra for advanced web automation, scraping, and AI-powered web intera…☆33Feb 4, 2026Updated 3 weeks ago
- ☆10Jan 23, 2025Updated last year
- ☆17Aug 5, 2025Updated 6 months ago
- ☆10Oct 2, 2024Updated last year
- A Simple PyTorch Lightning implementation of Masked Autoencoder☆15Jun 29, 2023Updated 2 years ago
- Tusk Drift Demo - Node.js Service☆58Jan 20, 2026Updated last month
- ☆11Feb 7, 2025Updated last year
- Community maintained hardware plugin for vLLM on AWS Neuron☆23Updated this week
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆41Dec 27, 2023Updated 2 years ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆114Dec 11, 2025Updated 2 months ago
- ICCV'23 | Adverse Weather Removal with Codebook Priors☆10Aug 28, 2023Updated 2 years ago
- MyStyle Custom Product Designer Plugin for Wordpress / Woo Commerce☆14Dec 12, 2025Updated 2 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- ☆12Apr 26, 2024Updated last year
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆10Apr 14, 2025Updated 10 months ago