AI-confused / arxiv_auto_crawlerLinks
auto scrawl for arrive data
☆16Updated 3 years ago
Alternatives and similar repositories for arxiv_auto_crawler
Users that are interested in arxiv_auto_crawler are comparing it to the libraries listed below
Sorting:
- ☆62Updated 6 months ago
- Keras implement of Finite Scalar Quantization☆83Updated 2 years ago
- pytorch单精度、半 精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆127Updated last year
- ☆118Updated 2 years ago
- ☆37Updated last year
- ☆33Updated last month
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆189Updated 2 years ago
- LMM solved catastrophic forgetting, AAAI2025☆44Updated 8 months ago
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆74Updated 4 months ago
- ☆70Updated 6 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆169Updated 2 months ago
- ☆42Updated 10 months ago
- ChineseCLIP using online learning☆13Updated 3 years ago
- ☆21Updated 2 months ago
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆45Updated last year
- Narrative movie understanding benchmark☆77Updated 6 months ago
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆49Updated last year
- ☆30Updated 8 months ago
- ☆91Updated 2 years ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Updated last year
- ☆72Updated 2 years ago
- Video dataset dedicated to portrait-mode video recognition.☆55Updated 2 months ago
- Official repository of MMDU dataset☆98Updated last year
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆29Updated 3 years ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆39Updated 5 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆112Updated 2 months ago
- A light-weight script for maintaining a LOT of machine learning experiments.☆92Updated 3 years ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆89Updated last year
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆289Updated 3 months ago
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆78Updated last month