HWTeng-Teaching / 202409-Stat
☆10Updated last month
Alternatives and similar repositories for 202409-Stat:
Users that are interested in 202409-Stat are comparing it to the libraries listed below
- ☆12Updated 7 months ago
- AWS AI Stack – A ready-to-use, full-stack boilerplate project for building serverless AI applications on AWS☆956Updated 2 months ago
- SOTA Open Source TTS☆18,677Updated this week
- We've moved beyond the limitations of traditional text length-based segmentation, opting for a smarter approach—semantic segmentation. Th…☆12Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆3,692Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆9,242Updated this week
- A fast multimodal LLM for real-time voice☆3,298Updated last week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆10,018Updated this week
- Agent Zero AI framework☆5,655Updated last week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆8,394Updated last week
- ☆7,296Updated this week
- Inference and training library for high-quality TTS models.☆4,950Updated last month
- Build real-time multimodal AI applications 🤖🎙 ️📹☆4,912Updated this week
- Agentic components of the Llama Stack APIs☆4,097Updated this week
- Composable building blocks to build Llama Apps☆6,885Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆4,011Updated last week
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,520Updated this week
- ☆5,158Updated this week
- Text-to-Music Generation with Rectified Flow Transformers☆1,656Updated last month
- Open source Claude Artifacts – built with Llama 3.1 405B☆5,353Updated last week
- TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time c…☆4,034Updated this week
- first base model for full-duplex conversational audio☆1,691Updated 3 weeks ago
- My first repository on GitHub!☆10Updated 3 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆3,388Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,773Updated 2 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆892Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆5,354Updated 3 weeks ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,166Updated this week
- This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available…☆719Updated this week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆3,546Updated this week