CoreJT / NLPPapersSpiderLinks
☆10Updated 5 years ago
Alternatives and similar repositories for NLPPapersSpider
Users that are interested in NLPPapersSpider are comparing it to the libraries listed below
Sorting:
- ☆43Updated 7 months ago
- a thin wrapper of chatgpt for improving paper writing.☆253Updated 2 years ago
- ☆35Updated 4 years ago
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆80Updated last week
- ChatGPT - Review & Rebuttal: A browser extension for generating reviews and rebuttals, powered by ChatGPT. 利用 ChatGPT 生成审稿意见和回复的浏览器插件☆251Updated 2 years ago
- [MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models☆290Updated 5 months ago
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆26Updated last year
- Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..☆79Updated 3 months ago
- ☆20Updated 5 months ago
- MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex with official info☆462Updated last year
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Updated last year
- Update 2020☆75Updated 3 years ago
- 一款便捷的抢占显卡脚本☆387Updated 2 weeks ago
- Modified LLaVA framework for MOSS2, and makes MOSS2 a multimodal model.☆13Updated last year
- Latest Papers, Codes and Datasets on VTG-LLMs.☆63Updated last month
- A python implement for Certifiable Robust Multi-modal Training☆19Updated 6 months ago
- [CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…☆63Updated 9 months ago
- EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Vi…☆70Updated 7 months ago
- The official repo of our work "Pensieve: Retrospect-then-Compare mitigates Visual Hallucination"☆16Updated last year
- ☆23Updated 2 years ago
- Collection of awesome Continual Test-Time Adaptation methods☆23Updated last year
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆56Updated last year
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆76Updated 4 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆127Updated last year
- The offical implemention of JM3D.☆31Updated 4 months ago
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆93Updated 9 months ago
- [NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents☆318Updated last year
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆56Updated 2 years ago
- ☆264Updated last year
- ☆46Updated 4 years ago