AI-confused / arxiv_auto_crawler
auto scrawl for arrive data
☆15Updated 3 years ago
Alternatives and similar repositories for arxiv_auto_crawler:
Users that are interested in arxiv_auto_crawler are comparing it to the libraries listed below
- Gpu 任务排队☆2Updated last year
- ChineseCLIP using online learning☆13Updated 2 years ago
- A light-weight script for maintaining a LOT of machine learning experiments.☆91Updated 2 years ago
- ☆20Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆80Updated 9 months ago
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Updated last year
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆44Updated 4 months ago
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆23Updated 2 years ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 7 months ago
- Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".☆57Updated last year
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆34Updated 5 months ago
- Product1M☆87Updated 2 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆30Updated last year
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆43Updated 3 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆56Updated last month
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆31Updated 3 years ago
- Official repository of MMDU dataset☆89Updated 6 months ago
- ☆36Updated 9 months ago
- WuDaoMM this is a data project☆73Updated 2 years ago
- ☆45Updated 7 months ago
- ☆69Updated last year
- Efficient Mixture of Experts for LLM Paper List☆62Updated 4 months ago
- ☆91Updated last year
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆22Updated last year
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆39Updated 6 months ago
- ☆22Updated last year
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆121Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆31Updated last month
- ☆66Updated last year
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"☆24Updated last month