🚀 基于 ASR + VLM 技术的智能视频笔记工具,能够将任何视频"吞噬"并生成包含图文内容和视频剪影的结构化笔记报告
☆107Oct 18, 2025Updated 7 months ago
Alternatives and similar repositories for video-devour
Users that are interested in video-devour are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reusable AI coding agent skills for building voice AI with LiveKit☆52Feb 25, 2026Updated 3 months ago
- 使用langraph构建Agentic-RAG☆23Jul 30, 2025Updated 9 months ago
- 一款开源的基金估值框架☆29Mar 30, 2026Updated last month
- 碧树西风经典文章☆11Dec 17, 2021Updated 4 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- 记录关于AEC的论文和代码、博客以及相关资料☆15Jul 26, 2022Updated 3 years ago
- ♨︎ Chat with ChatGPT, the advanced language AI model, right from your iOS device! Get answers, have a conversation, or improve your langu…☆22Feb 4, 2023Updated 3 years ago
- 智谱 glm realtime api python/golang/ts sdk, 包括 low level 的 websocket client 封装以及各个场景的调用样例☆28May 27, 2025Updated last year
- MVDR beamformer written in python☆10Jul 2, 2021Updated 4 years ago
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23May 11, 2024Updated 2 years ago
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- demos using speex☆12Apr 20, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Jul 29, 2025Updated 9 months ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- Understanding Deep Learning☆11Jul 23, 2024Updated last year
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 11 months ago
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago
- WayLog - Save & Export AI Chat History. A local-first extension that turns your fleeting AI conversations into a permanent, git-friendly …☆64Jan 17, 2026Updated 4 months ago
- A ChatGPT implementation with support for Bing's GPT-4 version of ChatGPT, plus the official ChatGPT model via OpenAI's API. Available as…☆11Feb 12, 2023Updated 3 years ago
- llm langchain quick start☆16Jun 14, 2023Updated 2 years ago
- Python code to show basic sound separation using Ideal Binary Masks☆13Oct 13, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 从vue出发一步一步靠近gis☆17Feb 10, 2022Updated 4 years ago
- PASE: Phonologically Anchored Speech Enhancer☆60Apr 9, 2026Updated last month
- Ambisonic Blind Reverberation Time Estimation☆12Jun 14, 2020Updated 5 years ago
- Communication-Cost Aware Microphone Selection For Neural Speech Enhancement with Ad-hoc Microphone Arrays☆17Nov 20, 2020Updated 5 years ago
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- A Tensorflow attempt to reimplement the IEEE VTC2019-Fall paper "DL-CFAR: A Novel CFAR Target Detection Method Based on Deep Learning"☆14Jul 29, 2023Updated 2 years ago
- 本项目是一款以datawhale小鲸鱼为原型设计的面向新人开发者的ai语音小车教程,附带整套打包硬件,帮助大家零门槛上手。教程正在持续更新中,项目交流群:请扫码进群☆55Jan 27, 2026Updated 3 months ago
- Official repo of ICASSP 2022 paper - Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization☆20Jan 7, 2025Updated last year
- ☆20Nov 22, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX☆25Oct 4, 2021Updated 4 years ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆26May 14, 2026Updated last week
- DistantSpeech☆22Oct 9, 2023Updated 2 years ago
- 📚 从零开始的向量数据库原理与实践教程,在线阅读地址:https://easy-vecdb.datawhale.cc/☆303May 12, 2026Updated 2 weeks ago
- Toolbox for Evaluation of AEC/AES Systems☆39Feb 18, 2026Updated 3 months ago
- Codes and Solutions of "Numerical Linear Algebra by Trefethen"☆15Nov 16, 2023Updated 2 years ago
- 几种VAD算法的测评☆25Jul 31, 2020Updated 5 years ago