AI-confused / arxiv_auto_crawlerLinks
auto scrawl for arrive data
☆16Updated 3 years ago
Alternatives and similar repositories for arxiv_auto_crawler
Users that are interested in arxiv_auto_crawler are comparing it to the libraries listed below
Sorting:
- ☆37Updated last year
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆26Updated 3 years ago
- ☆30Updated 6 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆55Updated 4 months ago
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆65Updated 2 months ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆191Updated last year
- ☆91Updated last year
- ☆43Updated 8 months ago
- Keras implement of Finite Scalar Quantization☆83Updated last year
- Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model☆273Updated last year
- ☆48Updated last year
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆274Updated last month
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆31Updated 4 months ago
- ☆56Updated 4 months ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆45Updated 5 months ago
- ☆72Updated 2 years ago
- ☆28Updated 4 months ago
- Product1M☆88Updated 3 years ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆35Updated 3 months ago
- WuDaoMM this is a data project☆74Updated 3 years ago
- LMM solved catastrophic forgetting, AAAI2025☆44Updated 6 months ago
- A light-weight script for maintaining a LOT of machine learning experiments.☆92Updated 2 years ago
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆20Updated 11 months ago
- [MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models☆288Updated 3 months ago
- The SAIL-VL2 series model developed by the BytedanceDouyinContent Group☆64Updated last month
- ☆69Updated 4 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆122Updated last year
- ☆21Updated last year
- ChineseCLIP using online learning☆13Updated 2 years ago
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Updated last year