实现LLM+ASR+TTS全本地化语音->语音交互,同时支持API对接,最终向VLM转型实现全智能场景的陪护
☆36Aug 26, 2025Updated 8 months ago
Alternatives and similar repositories for AITTS
Users that are interested in AITTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于 IMO25 和 Deep Research 的 Deep Think Agent☆48Oct 30, 2025Updated 6 months ago
- Playground for a hand-eye calibration with easy_handeye2, no hardware required.☆16Feb 6, 2025Updated last year
- Official repository of PAFUSE☆16Dec 10, 2024Updated last year
- helps people beautifiy their bookmark and lists their growing! 🐕☆25Apr 14, 2026Updated 3 weeks ago
- 动态IP VPS检测被墙自动换IP并通知telegram BOT☆11Mar 22, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PHP编写的广域网端口转发器☆12Jan 4, 2017Updated 9 years ago
- ☆27Dec 20, 2024Updated last year
- A self-adaptive and class-balanced approach to improve deep neural network performance in the presence of noisy labels☆19Jul 2, 2024Updated last year
- Remote control AI coding assistants (Claude Code/Codex) via Telegram☆28Oct 22, 2025Updated 6 months ago
- Code for GHA (ACCV2018)☆13Oct 31, 2018Updated 7 years ago
- ☆10Apr 7, 2025Updated last year
- ☆17Dec 7, 2025Updated 4 months ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆15Jul 31, 2025Updated 9 months ago
- EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…☆23Mar 8, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An open-source voice input application.☆34Apr 11, 2026Updated 3 weeks ago
- This repository provides a system for generating explanations in autonomous robots (ROS 2) based on log analysis using LLMs.☆12Nov 25, 2025Updated 5 months ago
- NetCore-Go 是一个功能丰富、高性能的 Go 语言网络库,提供了完整的网络编程解决方案,包括 TCP/UDP 服务器、WebSocket、HTTP 服务器、RPC、gRPC、KCP 协议支持,以及服务发现、负载均衡、配置管理、日志系统和监控指标等企业级功能。☆40Sep 20, 2025Updated 7 months ago
- Code for "Reconstructing 3D Human Pose from RGB-D Data with Occlusions" (PG 2023)☆13Nov 5, 2023Updated 2 years ago
- 用心陪伴,温暖相随 💕 Built with ❤️ using AgentScope☆50Apr 27, 2026Updated last week
- ☆16Jul 21, 2022Updated 3 years ago
- llm-based robot that intervenes only when needed☆36Aug 1, 2025Updated 9 months ago
- The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision…☆24Apr 6, 2025Updated last year
- [ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning☆28Jun 28, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ROS2 beta-package for Elfin robot☆39Dec 6, 2025Updated 5 months ago
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 7 months ago
- ☆19May 23, 2025Updated 11 months ago
- NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants☆12Mar 12, 2023Updated 3 years ago
- ☆25Jan 20, 2025Updated last year
- A lightweight web console for managing multiple One API compatible sites☆48Aug 9, 2025Updated 8 months ago
- Python环境一键安装脚本,适用于Linux☆23Nov 20, 2025Updated 5 months ago
- A visual-servo implementation based on IBVS by RealMan Robotics.☆19Oct 14, 2024Updated last year
- Multiple traffic entities detection and tracking from bird-view drone stationary videos https://engyasin.github.io/Offline_MOT/☆14Mar 27, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆20Aug 5, 2025Updated 9 months ago
- Helm charts for Gen3 Deployments☆13Updated this week
- TCSVT'26 & ICASSP'24☆17Mar 15, 2026Updated last month
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆35Dec 5, 2022Updated 3 years ago
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆33Dec 26, 2024Updated last year
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆24Oct 31, 2025Updated 6 months ago
- [IROS 2022] Transporters with Visual Foresight (TVF)☆11Jul 25, 2022Updated 3 years ago