onion-liu / arxiv_daily_aigcLinks
An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC
☆61Updated this week
Alternatives and similar repositories for arxiv_daily_aigc
Users that are interested in arxiv_daily_aigc are comparing it to the libraries listed below
Sorting:
- Customize your arXiv recommendation every day.☆121Updated 5 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆217Updated 4 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆81Updated 5 months ago
- PresentAgent: Multimodal Agent for Presentation Video Generation☆96Updated 3 weeks ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆129Updated 9 months ago
- AI视频剪辑☆201Updated 2 weeks ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆249Updated 2 weeks ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆466Updated 7 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆181Updated 5 months ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆216Updated this week
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆285Updated 2 months ago
- ☆166Updated 9 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆40Updated 9 months ago
- Fetch arxiv data to LLM-friendly text☆124Updated 6 months ago
- ☆266Updated last year
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆154Updated 5 months ago
- Cookbook for Crafting Good Code☆56Updated last year
- ☆77Updated 4 months ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆247Updated 2 months ago
- AI-Powered Video Retrieval & Clipping Tool☆325Updated last week
- 论文阅读工具,一键截图+AI翻译,支持数学公式,贴片多窗口管理☆114Updated this week
- [ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models☆73Updated 3 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆67Updated 2 weeks ago
- ☆121Updated 3 weeks ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 7 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆234Updated 5 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated 8 months ago
- ☆55Updated 9 months ago
- Awesome Instruction Editing. Image and Media Editing with Human Instructions. Instruction-Guided Image and Media Editing.☆83Updated this week
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆113Updated 2 weeks ago