Arthurzhangsheng / video_utilsLinks
视频分割、分解、合成代码
☆11Updated 6 years ago
Alternatives and similar repositories for video_utils
Users that are interested in video_utils are comparing it to the libraries listed below
Sorting:
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17Updated 2 years ago
- Chinese text generation, now open source news and prose model and code☆24Updated 2 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Updated 5 months ago
- 扣子API对话界面☆17Updated last year
- chinese real time voice cloning☆38Updated 6 years ago
- AI虚拟主播1.0,基于LSTM的实时对对联机器人☆12Updated 3 years ago
- AI开发者平台。目的是要搭建一个采集视频图像并调用API进行智能化数据标注,训练完成之后进行自动化测试的平台。☆33Updated 7 years ago
- 万国置地-智能客服机器人☆20Updated 9 years ago
- 根据音乐节奏自动进行视频卡点剪辑☆17Updated 4 years ago
- ☆15Updated 5 years ago
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆22Updated 3 months ago
- 基于DINet的推理服务,推理视频流和视频☆16Updated 2 years ago
- python 图像处理 以图搜图 无损压缩☆11Updated 6 years ago
- ☆12Updated 3 years ago
- ☆77Updated 2 years ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- 使用 PaddleGAN 套件的 Wave2lip 模型给照片上的人“配音、配嘴型儿”~~☆26Updated 4 years ago
- Video edit using AI☆48Updated 6 years ago
- 我的常用脚本☆91Updated last year
- 基于nodejs的知乎爬虫,x-zse-96,支持文章,评论,图片下载到本地☆16Updated 2 years ago
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆25Updated last year
- Convert ppt to video with audio track, using text to speech synthesis☆67Updated 7 years ago
- A music retrieve demo in Python☆36Updated 7 years ago
- 基于FastAPI的语音服务系统,集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。☆18Updated 7 months ago
- 小说人名统计和关系提取(基于HanLP)☆46Updated 6 years ago
- Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆87Updated this week
- LeseNet是一个类似ImageNet的图像层次化分类数据集,项目通过收集和标注生活垃圾图像帮助实现垃圾自动分类和分拣,此项目仅出于公益目的,请勿用于商业☆55Updated 6 years ago
- AI Short Video Engine: Where AI Meets Video Rendering. Transform Articles into Viral-Ready Video Reels with AI-Powered Precision. AI Shor…☆89Updated 5 months ago