实现LLM+ASR+TTS全本地化语音->语音交互,同时支持API对接,最终向VLM转型实现全智能场景的陪护
☆37Aug 26, 2025Updated 9 months ago
Alternatives and similar repositories for AITTS
Users that are interested in AITTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 120部中文网络小说对话语料库☆17Feb 14, 2017Updated 9 years ago
- 盘搜搜一款搜索网盘的WEB应用,之前做爬虫项目爬了某网盘160w +的网盘资源,以至于数据库装不下,只能公开部分数据(大约32w +)给大家了。配合此前端即可搜索一些emmm~。仅供技术学习交流^ - ^ http://pan.dyboy.cn/☆12Jul 22, 2018Updated 7 years ago
- This project explores zero-shot emotional speech synthesis using EMOD, a novel approach combining emotion and content embeddings for mult…☆18Dec 22, 2025Updated 5 months ago
- 首家工业级全流程可控协作式专业AIagent影视生产平台,从短片到漫剧到真人级影视剧一站搞定,采用好莱坞专业制作团队思路,让你拥有虚拟制片场☆52Mar 2, 2026Updated 2 months ago
- Codex / Claude Skill for editable thesis-defense PPTX from PDF or LaTeX while preserving a PowerPoint template. 从论文 PDF / LaTeX 生成可编辑答辩 P…☆109May 15, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Apr 7, 2025Updated last year
- Remote control AI coding assistants (Claude Code/Codex) via Telegram☆28Oct 22, 2025Updated 7 months ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆16Jul 31, 2025Updated 9 months ago
- This repository provides a system for generating explanations in autonomous robots (ROS 2) based on log analysis using LLMs.☆12Nov 25, 2025Updated 6 months ago
- An open-source voice input application.☆34May 14, 2026Updated last week
- Code for "Reconstructing 3D Human Pose from RGB-D Data with Occlusions" (PG 2023)☆13Nov 5, 2023Updated 2 years ago
- Hermes Agent 社区补丁合集 - Community patches for Hermes Agent☆121Updated this week
- ☆16Jul 21, 2022Updated 3 years ago
- llm-based robot that intervenes only when needed☆37Aug 1, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- opensource 88code desktop☆52Oct 20, 2025Updated 7 months ago
- ☆20Sep 11, 2024Updated last year
- 电商 AI 生图爆款流水线 — 输入新品平铺图,自动检索相似爆款、分析风格,生成专业宣传图。 ## 项目简介 跨境电商场景中,商家每出一款新品都需要制作宣传图(模特穿着商品的照片)。传统方式需要请模特、搭场景、拍摄修图,成本高、周期长。 本项目实现了一套全自动的 AI…☆154May 6, 2026Updated 2 weeks ago
- ☆19May 23, 2025Updated last year
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆25Dec 3, 2024Updated last year
- 小风车——超迷你的动态壁纸软件☆22Nov 11, 2025Updated 6 months ago
- ONNX implementation of YOLOv5 and Siamese Network (ResNet100) with ArcFace loss for Face Detection and Recognition☆24Feb 17, 2023Updated 3 years ago
- NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants☆12Mar 12, 2023Updated 3 years ago
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Mar 30, 2022Updated 4 years ago
- A visual-servo implementation based on IBVS by RealMan Robotics.☆21Oct 14, 2024Updated last year
- Helm charts for Gen3 Deployments☆13Updated this week
- TCSVT'26 & ICASSP'24☆17Mar 15, 2026Updated 2 months ago
- This is a ros2 package to create an arm robot in rviz using robot_state_publisher and joint_state_publisher gui☆32Sep 20, 2022Updated 3 years ago
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆24Oct 31, 2025Updated 6 months ago
- [IROS 2022] Transporters with Visual Foresight (TVF)☆11Jul 25, 2022Updated 3 years ago
- Text-to-Gesture Generation Model Using Convolutional Neural Network☆12Nov 21, 2022Updated 3 years ago
- An image fusion techniques presented in “Poisson image editing", P. Pérez, M. Gangnet, and A. Blake, SIGGRAPH 2003.☆14Jan 13, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 基于异步架构的轻量级 Bilibili 自动化机器人框架 /bot开发框架 | 插件化 | 收藏夹自动同步 | UP主视频订阅☆32Updated this week
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Aug 8, 2025Updated 9 months ago
- [CVPR 2025] Official Implementation of "MixerMDM: Learnable Composition of Human Motion Diffusion Models".☆25Sep 8, 2025Updated 8 months ago
- [NeurIPS 2023] Official Code for "Towards Robust and Expressive Whole-body Human Pose and Shape Estimation"☆50Feb 13, 2026Updated 3 months ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- ☆11Jul 18, 2024Updated last year
- ROS wrapper of Nvidia Contact-graspnet model.☆18Jul 3, 2023Updated 2 years ago