SxJyJay / LumenView external linksLinks
[NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities
☆25Sep 27, 2024Updated last year
Alternatives and similar repositories for Lumen
Users that are interested in Lumen are comparing it to the libraries listed below
Sorting:
- [ACM MM 2024] ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack☆14Dec 20, 2024Updated last year
- [ACM MM2023] Code Release of GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos☆12Mar 29, 2024Updated last year
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆34Aug 5, 2025Updated 6 months ago
- ☆24Oct 28, 2024Updated last year
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆105Apr 23, 2025Updated 9 months ago
- [ECCV 2022] MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes official implementation☆16Feb 2, 2023Updated 3 years ago
- [MM24 Oral] Identity-Driven Multimedia Forgery Detection via Reference Assistance☆119Jul 27, 2025Updated 6 months ago
- Adversarial Examples Detection Benchmark☆17Dec 6, 2024Updated last year
- [CVPR 2023] MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection☆203May 20, 2023Updated 2 years ago
- A curated list of Survey Papers on Deep Learning.☆11Sep 5, 2023Updated 2 years ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆13Aug 2, 2023Updated 2 years ago
- ☆21Jan 17, 2025Updated last year
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆52Jan 7, 2026Updated last month
- Code release for ECCV 2022 paper "RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds"☆26Mar 24, 2023Updated 2 years ago
- Official repository of paper: "FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation"☆25Mar 2, 2023Updated 2 years ago
- ☆28Oct 20, 2023Updated 2 years ago
- Code of Pyramid Vision Transformer at BMVC 2022☆27Jun 7, 2023Updated 2 years ago
- OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models☆54Feb 1, 2026Updated last week
- ☆30Mar 2, 2023Updated 2 years ago
- ☆33Nov 15, 2024Updated last year
- ☆32Mar 25, 2024Updated last year
- ☆34May 2, 2022Updated 3 years ago
- The official implementation for the 1st-place winner solution of GRSS DFC 2025 track1 'All Wheather Land Cover Mapping'☆14Apr 17, 2025Updated 9 months ago
- Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1.☆249Aug 12, 2025Updated 6 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- ☆93Dec 15, 2025Updated last month
- Tools for working with Long Short-Term Memory (LSTM) networks and sequences in Pytorch☆36Jan 29, 2021Updated 5 years ago
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆68Jul 1, 2025Updated 7 months ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- [CVPR 2021] FMO Deblurring Benchmark☆13Jan 12, 2022Updated 4 years ago
- ☆10Oct 13, 2024Updated last year
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- Code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models"☆28Jan 27, 2026Updated 2 weeks ago
- ☆41Sep 21, 2023Updated 2 years ago
- Automatically constructed lexical database for Bangla inspired from Wordnet☆11Jul 12, 2012Updated 13 years ago
- Chapter-wise notebooks for the book 'Practical Natural Language Processing'☆10Apr 21, 2020Updated 5 years ago
- The official code for ICCV 2023 paper "Reconstructing Groups of People with Hypergraph Relational Reasoning"☆12Jul 4, 2025Updated 7 months ago
- 2D Gaussian splatting for image compression☆17Nov 29, 2024Updated last year