junchen14 / Awesome_ChatGPT_papersLinks
This repository will collect and share awesome ChatGPT related papers and useful tools
☆18Updated 2 years ago
Alternatives and similar repositories for Awesome_ChatGPT_papers
Users that are interested in Awesome_ChatGPT_papers are comparing it to the libraries listed below
Sorting:
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆124Updated 3 months ago
- Multimodal-Procedural-Planning☆92Updated 2 years ago
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆68Updated 2 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆74Updated 9 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Updated last year
- Reading list for Multimodal Large Language Models☆68Updated 2 years ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 6 months ago
- LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation☆132Updated last year
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Updated last year
- A curated list of vision-and-language pre-training (VLP). :-)☆59Updated 3 years ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 3 years ago
- ☆30Updated 2 years ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated 2 years ago
- Residual Prompt Tuning: a method for faster and better prompt tuning.☆55Updated 2 years ago
- ☆50Updated last year
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆50Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- [ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents☆16Updated last year
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Updated 2 years ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆44Updated last year
- ☆65Updated last year
- 微信公众号:机器感知 | Tracking the Latest Arxiv Papers☆38Updated 3 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆90Updated 2 years ago
- Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN☆85Updated last year
- ☆66Updated 2 years ago
- Open LLaMA Eyes to See the World☆174Updated 2 years ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago