实现LLM+ASR+TTS全本地化语音->语音交互,同时支持API对接,最终向VLM转型实现全智能场景的陪护
☆35Aug 26, 2025Updated 7 months ago
Alternatives and similar repositories for AITTS
Users that are interested in AITTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of PAFUSE☆15Dec 10, 2024Updated last year
- ☆26Dec 20, 2024Updated last year
- Code for GHA (ACCV2018)☆13Oct 31, 2018Updated 7 years ago
- ☆10Apr 7, 2025Updated 11 months ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆14Jul 31, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- About Demo of CR/Nova ; Dobot V3 version ROS2☆34Aug 25, 2025Updated 7 months ago
- Official implementation of 'P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering'. (Accepted by ICLR 2024)☆18Jan 19, 2024Updated 2 years ago
- ☆19Sep 19, 2023Updated 2 years ago
- Artifact evaluation for "E2Usd: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series" accepted by WWW'24☆13Jul 29, 2024Updated last year
- This repository contains the code for our paper "Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive …☆14Nov 23, 2023Updated 2 years ago
- This repository provides a system for generating explanations in autonomous robots (ROS 2) based on log analysis using LLMs.☆12Nov 25, 2025Updated 4 months ago
- Original implementation of SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback (ICLR 2025)☆17Feb 17, 2025Updated last year
- Code for "Reconstructing 3D Human Pose from RGB-D Data with Occlusions" (PG 2023)☆13Nov 5, 2023Updated 2 years ago
- ☆16Jul 21, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- llm-based robot that intervenes only when needed☆36Aug 1, 2025Updated 7 months ago
- 基于C++实现的一款分布式Linux性能监控器,使用gRPC框架对CPU状态、系统负载、软中断、内存信息以及网络接口状态进行监控,数据每三秒刷新一次与top默认刷新间隔一致;通过dockerfile构建整个项目环境,并使用stress工具进行模拟压测,分析相应时刻服务器的c…☆17Sep 26, 2024Updated last year
- The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision…☆25Apr 6, 2025Updated 11 months ago
- [NeurIPS 2023 Spotlight] The Pursuit of Human Labeling: A New Perspective on Unsupervised Learning☆19Nov 7, 2023Updated 2 years ago
- [ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning☆28Jun 28, 2023Updated 2 years ago
- ☆20Sep 11, 2024Updated last year
- q2r ros code☆37May 20, 2024Updated last year
- 这是一个基于手势识别的计算机视觉综合系统毕业设计代码☆18Mar 23, 2024Updated 2 years ago
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19May 23, 2025Updated 10 months ago
- ONNX implementation of YOLOv5 and Siamese Network (ResNet100) with ArcFace loss for Face Detection and Recognition☆24Feb 17, 2023Updated 3 years ago
- NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants☆12Mar 12, 2023Updated 3 years ago
- ☆72Mar 18, 2026Updated last week
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated last year
- ☆13Mar 30, 2022Updated 3 years ago
- A visual-servo implementation based on IBVS by RealMan Robotics.☆18Oct 14, 2024Updated last year
- Helm charts for Gen3 Deployments☆14Updated this week
- TCSVT'26 & ICASSP'24☆16Mar 15, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆42Jun 12, 2025Updated 9 months ago
- This is a ros2 package to create an arm robot in rviz using robot_state_publisher and joint_state_publisher gui☆31Sep 20, 2022Updated 3 years ago
- ☆22Aug 30, 2021Updated 4 years ago
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆32Dec 26, 2024Updated last year
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆20Oct 31, 2025Updated 4 months ago
- [IROS 2022] Transporters with Visual Foresight (TVF)☆11Jul 25, 2022Updated 3 years ago
- Text-to-Gesture Generation Model Using Convolutional Neural Network☆12Nov 21, 2022Updated 3 years ago