[CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
☆52Apr 11, 2026Updated this week
Alternatives and similar repositories for NUMINA
Users that are interested in NUMINA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆54Mar 13, 2026Updated last month
- ☆70Jan 12, 2026Updated 3 months ago
- Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agen…☆72Updated this week
- ☆25Oct 25, 2025Updated 5 months ago
- For FUN! Use at your own risk. No warranties, no exceptions.☆19Dec 8, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Sep 3, 2025Updated 7 months ago
- Overworld's local world client interface to run Waypoint world models☆84Updated this week
- WSGI library for simple web servers☆19Mar 27, 2026Updated 2 weeks ago
- ☆10Feb 23, 2026Updated last month
- Quickly select items in any modal using keyboard shortcuts. Supercharge your workflow with fast, efficient item selection in Obsidian mod…☆18Mar 2, 2026Updated last month
- External telegram feeder for AIL framework☆18Jan 21, 2026Updated 2 months ago
- the official code of DriveMonkey☆45Mar 20, 2026Updated 3 weeks ago
- Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversations☆43Nov 19, 2025Updated 4 months ago
- Fud AI is an open sourced and Free AI calorie tracker for iOS☆102Apr 1, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An Open Payment Gateway for Humans and AI Agents.☆48Nov 10, 2025Updated 5 months ago
- Animate Any Character in Any World☆97Mar 10, 2026Updated last month
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…☆220May 7, 2025Updated 11 months ago
- Reinforcement Learning Framework for Visual Generation☆105Feb 13, 2026Updated 2 months ago
- [CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“☆95Mar 19, 2026Updated 3 weeks ago
- A command-line interface (CLI) host platform that facilitates interactions between Large Language Models and external tools via the Model…☆20Nov 26, 2025Updated 4 months ago
- JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers☆17Jul 21, 2025Updated 8 months ago
- ☆12Jul 22, 2020Updated 5 years ago
- ☆12Oct 14, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python tool to automate Google NotebookLM podcast creation with website/YouTube sources☆31May 30, 2025Updated 10 months ago
- [ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model☆114Jan 26, 2026Updated 2 months ago
- Shows whether a YouTube channel or its videos are monetized or not, and provides details on their monetization status.☆17Dec 17, 2023Updated 2 years ago
- Simple MCP to give Claude ability to check current time as well as know when holidays are, what is the time distance between dates etc.☆28Feb 18, 2026Updated last month
- Get AI-powered daily news summaries directly in your Obsidian vault. Stay informed about your topics of interest with smart, automated ne…☆22Updated this week
- Source code for UP-Diff☆15Nov 26, 2024Updated last year
- UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions☆48Dec 16, 2025Updated 3 months ago
- [ICML'25] CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features☆122Aug 23, 2025Updated 7 months ago
- Hugging Face Jobs☆19Jul 11, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Colorizing Black & White images using GAN☆11Jan 13, 2023Updated 3 years ago
- 记录学习深度学习的一些☆17May 5, 2021Updated 4 years ago
- A collection of ComfyUI custom nodes based workflows and experiments.- Awesome smart way to work with nodes!☆17Nov 4, 2023Updated 2 years ago
- ComfyUI Assistant Custom Node☆22May 22, 2024Updated last year
- Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.☆37Jan 16, 2026Updated 2 months ago
- The Tiktok scraper application is designed to automate the process of extracting and processing Tiktok videos and their associated titles…☆17Jul 13, 2024Updated last year
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆55Apr 9, 2025Updated last year