🚀 基于 ASR + VLM 技术的智能视频笔记工具,能够将任何视频"吞噬"并生成包含图文内容和视频剪影的结构化笔记报告
☆60Oct 18, 2025Updated 5 months ago
Alternatives and similar repositories for video-devour
Users that are interested in video-devour are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 一款开源的基金估值框架☆23Updated this week
- 碧树西风经典文章☆10Dec 17, 2021Updated 4 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ♨︎ Chat with ChatGPT, the advanced language AI model, right from your iOS device! Get answers, have a conversation, or improve your langu…☆22Feb 4, 2023Updated 3 years ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 记录关于AEC的论文和代码、博客以及相关资料☆15Jul 26, 2022Updated 3 years ago
- 智谱 glm realtime api python/golang/ts sdk, 包括 low level 的 websocket client 封装以及各个场景的调用样例☆24May 27, 2025Updated 9 months ago
- a version tools. face detector,face landmark detector,face parsing and so on☆12Jul 30, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23May 11, 2024Updated last year
- MVDR beamformer written in python☆10Jul 2, 2021Updated 4 years ago
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- PASE: Phonologically Anchored Speech Enhancer☆44Dec 10, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- demos using speex☆12Apr 20, 2018Updated 7 years ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- 项目的issue会存放我的所有blog☆19Sep 12, 2025Updated 6 months ago
- Understanding Deep Learning☆11Jul 23, 2024Updated last year
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 9 months ago
- Feedforward Sequential Memory Networks☆16Aug 2, 2022Updated 3 years ago
- 基于图像的房子装修风格与家具风格检索、匹配和推荐算法和软件☆11Jun 19, 2021Updated 4 years ago
- llm langchain quick start☆16Jun 14, 2023Updated 2 years ago
- Python code to show basic sound separation using Ideal Binary Masks☆13Oct 13, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Oct 12, 2023Updated 2 years ago
- 从vue出发一步一步靠近gis☆17Feb 10, 2022Updated 4 years ago
- Ambisonic Blind Reverberation Time Estimation☆12Jun 14, 2020Updated 5 years ago
- Communication-Cost Aware Microphone Selection For Neural Speech Enhancement with Ad-hoc Microphone Arrays☆18Nov 20, 2020Updated 5 years ago
- 本项目是一款以datawhale小鲸鱼为原型设计的面向新人开发者的ai语音小车教程,附带整套打包硬件,帮助大家零门槛上手。教程正在持续更新中,项目交流群:请扫码进群☆51Jan 27, 2026Updated last month
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- 📚 从零开始的向量数据库原理与实践教程,在线阅读地址:https://easy-vecdb.datawhale.cc/☆236Feb 12, 2026Updated last month
- A Tensorflow attempt to reimplement the IEEE VTC2019-Fall paper "DL-CFAR: A Novel CFAR Target Detection Method Based on Deep Learning"☆15Jul 29, 2023Updated 2 years ago
- Official repo of ICASSP 2022 paper - Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization☆20Jan 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆20Nov 22, 2020Updated 5 years ago
- Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX☆25Oct 4, 2021Updated 4 years ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆24Jun 9, 2025Updated 9 months ago
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆47Oct 11, 2025Updated 5 months ago
- DistantSpeech☆22Oct 9, 2023Updated 2 years ago
- Codes and Solutions of "Numerical Linear Algebra by Trefethen"☆15Nov 16, 2023Updated 2 years ago
- Toolbox for Evaluation of AEC/AES Systems☆35Feb 18, 2026Updated last month