onion-liu / arxiv_daily_aigcLinks
An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC
☆62Updated this week
Alternatives and similar repositories for arxiv_daily_aigc
Users that are interested in arxiv_daily_aigc are comparing it to the libraries listed below
Sorting:
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆218Updated 5 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆99Updated last week
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆82Updated 6 months ago
- Customize your arXiv recommendation every day.☆123Updated 5 months ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆129Updated 10 months ago
- 收集整理一些在Seedream 4.0 下生成的令人惊艳的图片和提示 词☆74Updated this week
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆291Updated 3 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆251Updated last month
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆184Updated 6 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆39Updated 9 months ago
- AI视频剪辑☆218Updated last month
- ☆168Updated 10 months ago
- Fetch arxiv data to LLM-friendly text☆125Updated 6 months ago
- ☆275Updated last year
- 论文阅读工具,一键截图+AI翻译,支持数学公式,贴片多窗口管理☆120Updated 3 weeks ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆227Updated 3 weeks ago
- ☆123Updated last month
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆154Updated 6 months ago
- Open Image Curation Tools☆48Updated 4 months ago
- Convert Everything to PDF☆164Updated 4 months ago
- ☆77Updated 5 months ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆472Updated 7 months ago
- Learning records for building a large language model from scratch☆57Updated 8 months ago
- Cookbook for Crafting Good Code☆56Updated last year
- AI-Powered Video Retrieval & Clipping Tool☆337Updated 3 weeks ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆253Updated 3 weeks ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆240Updated 5 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated 8 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆300Updated this week
- coze api to openai☆15Updated last year