☆13May 27, 2026Updated this week
Alternatives and similar repositories for Arcana
Users that are interested in Arcana are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆21Jul 2, 2025Updated 10 months ago
- RS Generate dataset☆18Jan 2, 2025Updated last year
- [ICLR 2025] Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"☆64Jan 23, 2025Updated last year
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 5 months ago
- ☆45Apr 16, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of the paper Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs☆12Jun 7, 2025Updated 11 months ago
- ☆15Dec 9, 2024Updated last year
- [CVPR 2023] Spatial-then-Temporal Self-Supervised Learning for Video Correspondence☆11Jul 5, 2023Updated 2 years ago
- [TMM 2025] This is the official Pytorch code for our paper "Visual Position Prompt for MLLM based Visual Grounding".☆30Jul 23, 2025Updated 10 months ago
- Code for paper "Spider: Any-to-Many Multimodal LLM"☆15Apr 26, 2025Updated last year
- [TPAMI]CTNet: Context-based Tandem Network for Semantic Segmentation☆16Jun 15, 2022Updated 3 years ago
- CVPR2024 highlight.☆13Oct 10, 2024Updated last year
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆144May 11, 2026Updated 2 weeks ago
- Codura is an intelligent code assistant designed to supercharge your IDE with context-aware code completion, inline explanations, test ca…☆42Jul 13, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Jan 18, 2024Updated 2 years ago
- Strategy for crypto trading bot freqtrade and some useful utils☆39Nov 17, 2024Updated last year
- ☆14Sep 6, 2024Updated last year
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion☆12Jan 14, 2026Updated 4 months ago
- A collection of resources and papers on diffusion models of video generation.☆10Feb 11, 2023Updated 3 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆18Feb 23, 2026Updated 3 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 11 months ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repo of the paper titled DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction.☆23Updated this week
- Unofficial mirror☆12Jul 13, 2017Updated 8 years ago
- ☆22May 16, 2023Updated 3 years ago
- ☆13Sep 14, 2022Updated 3 years ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- Official implementation of the paper: "NeoBabel: A Multilingual Open Tower for Visual Generation"☆23Aug 4, 2025Updated 9 months ago
- [ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"☆38Mar 11, 2026Updated 2 months ago
- ☆12Mar 28, 2022Updated 4 years ago
- ☆11Feb 2, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Apr 23, 2024Updated 2 years ago
- Implementation of 'FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention'☆11Mar 27, 2024Updated 2 years ago
- [IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆20Jan 15, 2026Updated 4 months ago
- Normalizing Flows with Multi-Scale Autoregressive Priors (CVPR 2020)☆16Jul 22, 2022Updated 3 years ago
- Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)☆12May 7, 2025Updated last year
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆29May 27, 2025Updated last year
- The SpaceNet 7 Baseline Algorithm☆15Jun 30, 2022Updated 3 years ago