CharlieDDDD/AISurveyPapers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CharlieDDDD/AISurveyPapers)

CharlieDDDD / AISurveyPapers

Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey

☆21

Alternatives and similar repositories for AISurveyPapers

Users that are interested in AISurveyPapers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lixsh6 / GraRetrieval-CIKM2020
View on GitHub
☆13Nov 9, 2021Updated 4 years ago
chuzhumin98 / ConvSearch-Dataset
View on GitHub
The homepage for ConvSearch Dataset.
☆14May 31, 2022Updated 4 years ago
EternityYW / Gemini-Commonsense-Evaluation
View on GitHub
Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"
☆38Jan 3, 2024Updated 2 years ago
hectorcarrion / FEDD
View on GitHub
Data & Code for FEDD published @ MICCAI 23
☆12Oct 11, 2023Updated 2 years ago
catid / minigpt4
View on GitHub
MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code
☆15Jun 12, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
swordlidev / Evaluation-Multimodal-LLMs-Survey
View on GitHub
A Survey on Benchmarks of Multimodal Large Language Models
☆156Jul 13, 2026Updated last week
marslanm / Multimodality-Representation-Learning
View on GitHub
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…
☆85Jun 16, 2025Updated last year
LeeYN-43 / Clover
View on GitHub
Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)
☆39Feb 15, 2023Updated 3 years ago
bounverif / openx-assets
View on GitHub
Vehicle and traffic simulation assets using ASAM OpenX standards
☆15Sep 11, 2025Updated 10 months ago
noahzn / LISU
View on GitHub
Low-light Indoor Scene Understanding
☆16Dec 14, 2022Updated 3 years ago
jingtaozhan / IntelligenceTest
View on GitHub
An evaluation framework to test AI in a trial-and-error process. It is a simplified Natural Selection test.
☆22Mar 11, 2025Updated last year
zhenyuw16 / CompAgent_code
View on GitHub
Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".
☆18Jan 30, 2024Updated 2 years ago
Shadow-Dream / Reaction-Graph
View on GitHub
[ICML 2025] Official implementation of Reaction Graph: Towards Reaction-Level Modeling for Chemical Reactions with 3D Structures
☆16Sep 4, 2025Updated 10 months ago
hehefan / PST-Transformer
View on GitHub
☆23Mar 22, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
tommyMessi / text_render_pos
View on GitHub
带有位置信息的中文文本识别数据生成器
☆11Jan 28, 2021Updated 5 years ago
WangWenhao0716 / Awesome-Diffusion-Replication
View on GitHub
Replication in Visual Diffusion Models: A Survey and Outlook
☆32Apr 5, 2026Updated 3 months ago
kyegomez / PaLM2-VAdapter
View on GitHub
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…
☆17Nov 11, 2024Updated last year
zhenwang9102 / X-MedRELA
View on GitHub
Source Code for ACL 2020 paper, "Rationalizing Medical Relation Prediction from Corpus-level Statistics"
☆11Sep 6, 2020Updated 5 years ago
WeiminXiong / RationaleCL
View on GitHub
Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)
☆12Oct 11, 2023Updated 2 years ago
TArdelean / AnomalyLocalizationFCA
View on GitHub
Official implementation of High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis.
☆12Dec 18, 2023Updated 2 years ago
cyzus / thoughtsculpt
View on GitHub
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Dec 13, 2024Updated last year
danaesavi / ImageChain
View on GitHub
This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…
☆15Jun 4, 2025Updated last year
ellenmellon / DIALKI
View on GitHub
DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
☆10Aug 3, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kaishxu / DFMed
View on GitHub
Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)
☆14Nov 22, 2023Updated 2 years ago
iwangjian / pyloader
View on GitHub
🐳 PyLoader: An asynchronous Python dataloader for loading big datasets, supporting PyTorch and TensorFlow 2.x.
☆11Aug 29, 2021Updated 4 years ago
adityagilra / archibrain
View on GitHub
Synthesize bio-plausible neural networks for cognitive tasks, mimicking brain architecture
☆11Apr 14, 2021Updated 5 years ago
nowazrabbani / pMoE_CNN
View on GitHub
The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…
☆14Feb 12, 2026Updated 5 months ago
swordlidev / Efficient-Multimodal-LLMs-Survey
View on GitHub
Efficient Multimodal Large Language Models: A Survey
☆386Apr 29, 2025Updated last year
cxzhou35 / notion-arxiv-enhancer
View on GitHub
This project is my attempt at automating work in Notion.
☆17Aug 28, 2025Updated 10 months ago
Meteor-han / ReaMVP
View on GitHub
☆16Aug 5, 2024Updated last year
JunMa11 / PETCTSeg
View on GitHub
Automatic segmentation models for PET and CT scans
☆17Oct 13, 2022Updated 3 years ago
microsoft / MM-WebAgent
View on GitHub
Build coherent and visually polished multimodal webpages with hierarchical planning, AIGC tools, and iterative reflection.
☆15May 17, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
iwangjian / TRIP
View on GitHub
[TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
☆14Oct 18, 2025Updated 9 months ago
GraphPKU / LIFT
View on GitHub
The official implementation of LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning
☆15Mar 14, 2025Updated last year
AYOMITIDE-OAJ / OpenAI-mobile
View on GitHub
Mobile App Interface to interact with OpenAI (DALLE 2 and ChatGPT) open source tools
☆13Jan 16, 2023Updated 3 years ago
XTCHDU / anti_jamming
View on GitHub
☆12Jan 12, 2019Updated 7 years ago
fusiming3 / MARS
View on GitHub
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆86Jul 16, 2024Updated 2 years ago
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
SupstarZh / WhitenedCSE
View on GitHub
[ACL2023] WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings
☆18Sep 12, 2023Updated 2 years ago