This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆29Aug 10, 2021Updated 4 years ago
Alternatives and similar repositories for audio_search
Users that are interested in audio_search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A ffmpeg rtmp push demo which support h264 hardware encode and software encode☆11Apr 8, 2021Updated 5 years ago
- 基于rknn的yolov5的cpp实现,包含各种依赖库,是一个完整工程,可直接编译运行☆20Feb 10, 2022Updated 4 years ago
- ☆16Nov 23, 2022Updated 3 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- ☆10Aug 3, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 3 years ago
- Music production for silent film clips.☆32Apr 30, 2025Updated last year
- Codebase, data and models for the Headline Grouping paper at NAACL2021☆12Oct 2, 2022Updated 3 years ago
- Reading comprehension based question-answering model for news articles.☆11Jun 22, 2022Updated 3 years ago
- 达摩fsmn vad c++推理服务☆18Apr 17, 2023Updated 3 years ago
- Newspaper Segmentation into images and text☆12Jan 11, 2019Updated 7 years ago
- ☆16Apr 24, 2021Updated 5 years ago
- An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.☆15May 14, 2025Updated last year
- This tool can convert picture format(NV12/YUYV/UYVY...) to (png/jpg/bmp)☆10Jul 14, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Feb 9, 2018Updated 8 years ago
- Unofficial Tensorflow/Keras implementation of Google AI VoiceFilter☆16Mar 25, 2023Updated 3 years ago
- This repo contains the code for the tutorial for using the CrewAI agent framework to generate Sales Reports based on Salesforce data☆13Mar 16, 2024Updated 2 years ago
- ☆11Dec 19, 2020Updated 5 years ago
- repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commo…☆19May 20, 2025Updated last year
- This AI tool leverages different LLM services to generate product information from a given image. Simply upload an image of a product and…☆15Jun 25, 2024Updated last year
- transformer的 encoder-decoder结构基于tensorflow实现的中文语音识别项目☆34Feb 24, 2021Updated 5 years ago
- The code repo for Youtube tutorial series about using Python asyncio with OpenCV to grab frames from video cameras concurrently☆16Oct 3, 2021Updated 4 years ago
- A simple package of face detection☆14Nov 27, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- detecting the meotions using by analysing the sound of the person unsing python☆11Oct 7, 2019Updated 6 years ago
- a rust crate for easily implementing faster-whisper stt into your rust programs.☆24Oct 20, 2025Updated 7 months ago
- ☆12Apr 9, 2021Updated 5 years ago
- Detect paper corners in picture / video☆17Nov 25, 2019Updated 6 years ago
- Bitcoin Hourly OHLCV with 70+ Technical Indicators | Daily Updated Dataset for ML & Trading Analysis☆26Updated this week
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆21Aug 15, 2024Updated last year
- ☆15Nov 28, 2023Updated 2 years ago
- ☆21May 8, 2026Updated last month
- ☆12Jul 27, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- ☆16Jun 8, 2021Updated 5 years ago
- Time-domain Audio Separation Network☆24Aug 3, 2018Updated 7 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- Audio Classification with machine learning☆18Jun 8, 2026Updated last week
- implementation of DLDL-v2☆10Jul 17, 2019Updated 6 years ago
- 用户行为分析-用户关联☆14Nov 18, 2020Updated 5 years ago