☆147Apr 30, 2026Updated this week
Alternatives and similar repositories for SenseNova-Skills
Users that are interested in SenseNova-Skills are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- research cli for agent☆167Updated this week
- Update playlist☆25Updated this week
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆31Mar 18, 2026Updated last month
- ☆10Dec 3, 2024Updated last year
- ☆16Sep 11, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Jul 3, 2024Updated last year
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆87Updated this week
- ☆24Nov 21, 2025Updated 5 months ago
- ☆13May 15, 2025Updated 11 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- Debugging skill for AI agents☆249Updated this week
- An inference-time, plug-and-play method for temporal control in multi-event generation☆134Apr 26, 2026Updated last week
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- LLaVA-Next for STVG☆19Dec 5, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the paper: Network Decoupling: From Regular to Depthwise Separable Convolutions☆13Dec 9, 2018Updated 7 years ago
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆46Feb 10, 2026Updated 2 months ago
- https://avocado-captioner.github.io/☆34Oct 16, 2025Updated 6 months ago
- ☆17Mar 24, 2025Updated last year
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆72Aug 8, 2025Updated 8 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆20Jun 19, 2025Updated 10 months ago
- I wanted a node to save my prompts and optionally take in an external prompt from a llm and save it, didn't see one, so I made it.☆53Jan 18, 2026Updated 3 months ago
- LLM inference engine written in .NET☆410Apr 22, 2026Updated last week
- Collection of papers about video-audio understanding☆25Dec 26, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆38Nov 9, 2025Updated 5 months ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated last month
- [NeurIPS 2025 Spotlight] Official PyTorch implementation of Vgent☆45Nov 30, 2025Updated 5 months ago
- [AAAI 2025] Open-source, End-to-end, Medical Image Segmentation model by Task allociation.☆35May 22, 2025Updated 11 months ago
- A benchmark suited especially for deep learning operators☆42Feb 13, 2023Updated 3 years ago
- CVPR25☆27Jul 2, 2025Updated 10 months ago
- Deep Learning - Code Hub: A repository for deep learning projects which includes simple basic functions, experimental projects and paper …☆18Apr 20, 2020Updated 6 years ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representat…☆194Dec 28, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders☆17Feb 11, 2025Updated last year
- [ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Jan 28, 2026Updated 3 months ago
- ☆29Jul 25, 2025Updated 9 months ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆31Nov 24, 2025Updated 5 months ago
- ☆17Aug 5, 2025Updated 8 months ago
- ☆15Apr 13, 2025Updated last year
- ☆108Apr 9, 2026Updated 3 weeks ago