onion-liu / arxiv_daily_aigc
An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC
☆35Updated this week
Alternatives and similar repositories for arxiv_daily_aigc
Users that are interested in arxiv_daily_aigc are comparing it to the libraries listed below
Sorting:
- Customize your arXiv recommendation every day.☆101Updated last month
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆78Updated last month
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆211Updated last month
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆130Updated 5 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆257Updated 2 weeks ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆33Updated this week
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆191Updated last month
- ☆156Updated 6 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆232Updated 2 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆162Updated 2 months ago
- Comfyui-workflow☆41Updated 5 months ago
- Fetch arxiv data to LLM-friendly text☆116Updated 2 months ago
- ☆76Updated 3 weeks ago
- 快速分享大模型生成的HTML、Markdown、SVG、Mermaid代码☆83Updated 3 weeks ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆189Updated 2 weeks ago
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆144Updated 2 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆142Updated last week
- Cookbook for Crafting Good Code☆54Updated last year
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆147Updated last week
- ☆240Updated 8 months ago
- The development and future prospects of multimodal reasoning models.☆88Updated this week
- ☆19Updated last week
- The project page of Diffutoon☆26Updated last year
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 4 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆117Updated 2 months ago
- Googles NotebookLM but local☆229Updated 3 weeks ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 4 months ago
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆355Updated 7 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆145Updated 2 weeks ago
- ☆53Updated 6 months ago