[NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities
☆25Sep 27, 2024Updated last year
Alternatives and similar repositories for Lumen
Users that are interested in Lumen are comparing it to the libraries listed below
Sorting:
- [ACM MM 2024] ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack☆14Dec 20, 2024Updated last year
- [ACM MM2023] Code Release of GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos☆12Mar 29, 2024Updated last year
- EventHallusion: Diagnosing Event Hallucinations in Video LLMs☆34Aug 5, 2025Updated 7 months ago
- ☆24Oct 28, 2024Updated last year
- [AAAI2022] Code Release of Attacking Video Recognition Models with Bullet-Screen Comments☆25Mar 30, 2024Updated last year
- [MM24 Oral] Identity-Driven Multimedia Forgery Detection via Reference Assistance☆119Jul 27, 2025Updated 7 months ago
- Adversarial Examples Detection Benchmark☆17Dec 6, 2024Updated last year
- [CVPR 2023] MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection☆203May 20, 2023Updated 2 years ago
- [CVPR 2024] Official implementation of CVPR 2024 paper: "Doubly Abductive Counterfactual Inference for Text-based Image Editing"☆25Mar 8, 2024Updated last year
- A curated list of Survey Papers on Deep Learning.☆11Sep 5, 2023Updated 2 years ago
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆13Aug 2, 2023Updated 2 years ago
- ☆21Jan 17, 2025Updated last year
- Open-source red teaming framework for MLLMs with 37+ attack methods☆226Updated this week
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆52Jan 7, 2026Updated last month
- ☆28Oct 20, 2023Updated 2 years ago
- Official repository of paper: "FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation"☆26Mar 2, 2023Updated 3 years ago
- [ICLR 2025] BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks☆30Nov 2, 2025Updated 4 months ago
- Code of Pyramid Vision Transformer at BMVC 2022☆27Jun 7, 2023Updated 2 years ago
- ☆33Nov 15, 2024Updated last year
- ☆32Mar 25, 2024Updated last year
- Code segment are often used in deep learning algorithms(pytorch/numpy)☆28Aug 28, 2020Updated 5 years ago
- ☆35May 2, 2022Updated 3 years ago
- Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1.☆249Aug 12, 2025Updated 6 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- Tools for working with Long Short-Term Memory (LSTM) networks and sequences in Pytorch☆36Jan 29, 2021Updated 5 years ago
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆44Dec 8, 2024Updated last year
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- ☆10Oct 13, 2024Updated last year
- Material for my lectures at the University of Oslo, Dec 2014☆16Jul 25, 2015Updated 10 years ago
- Project focused on enhancing the quality of low-fidelity endoscopy images using Generative Adversarial Networks (GANs) implemented in PyT…☆17Jun 5, 2025Updated 9 months ago
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆69Jul 1, 2025Updated 8 months ago
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆186Jul 5, 2024Updated last year
- ☆41Sep 21, 2023Updated 2 years ago
- The official code for ICCV 2023 paper "Reconstructing Groups of People with Hypergraph Relational Reasoning"☆12Jul 4, 2025Updated 8 months ago
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated 2 months ago
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- Chapter-wise notebooks for the book 'Practical Natural Language Processing'☆10Apr 21, 2020Updated 5 years ago