chumingqian / CycleGuardianLinks
A lightwight Framework for the Respiratory Sound Classification
☆11Updated last year
Alternatives and similar repositories for CycleGuardian
Users that are interested in CycleGuardian are comparing it to the libraries listed below
Sorting:
- LLM Powered Social Media Simulator☆10Updated 10 months ago
- A walk through HuggingFace smolagents☆48Updated 11 months ago
- ☆11Updated last year
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated this week
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆19Updated last week
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated this week
- Feature extraction from sound signals along with complete CNN model and evaluations using tensorflow, keras and, librosa for MFCC generat…☆10Updated 4 years ago
- ☆18Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆150Updated 2 weeks ago
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a…☆46Updated 2 years ago
- Train your own speech AI model from scratch☆146Updated last week
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced …☆24Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆25Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- ☆60Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆245Updated 11 months ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- A curated list of awesome voice activity detection☆71Updated last year
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆46Updated 3 years ago
- Onnx compatible styletts2 code☆17Updated 8 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Updated last year
- Dual Bayesian ResNet: A Deep Learning Approach to Heart Murmur Detection (Physionet Challenge 2022)☆23Updated 4 months ago
- ☆30Updated last week
- An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"☆14Updated this week
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆167Updated last year