🚀 基于 ASR + VLM 技术的智能视频笔记工具,能够将任何视频"吞噬"并生成包含图文内容和视频剪影的结构化笔记报告
☆112Oct 18, 2025Updated 7 months ago
Alternatives and similar repositories for video-devour
Users that are interested in video-devour are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用langraph构建Agentic-RAG☆23Jul 30, 2025Updated 10 months ago
- 一款开源的基金估值、智能理财Agent框架☆33Updated this week
- 碧树西风经典文章☆11Dec 17, 2021Updated 4 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 记录关于AEC的论文和代码、博客以及相关资料☆15Jul 26, 2022Updated 3 years ago
- ♨︎ Chat with ChatGPT, the advanced language AI model, right from your iOS device! Get answers, have a conversation, or improve your langu…☆22Feb 4, 2023Updated 3 years ago
- a version tools. face detector,face landmark detector,face parsing and so on☆12Jul 30, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- MVDR beamformer written in python☆10Jul 2, 2021Updated 4 years ago
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- ☆13Jan 14, 2025Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- demos using speex☆12Apr 20, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆16Jul 29, 2025Updated 10 months ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- 项目的issue会存放我的所有blog☆20Sep 12, 2025Updated 9 months ago
- Understanding Deep Learning☆11Jul 23, 2024Updated last year
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated last year
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago
- WayLog - Save & Export AI Chat History. A local-first extension that turns your fleeting AI conversations into a permanent, git-friendly …☆68Jan 17, 2026Updated 4 months ago
- A ChatGPT implementation with support for Bing's GPT-4 version of ChatGPT, plus the official ChatGPT model via OpenAI's API. Available as…☆11Feb 12, 2023Updated 3 years ago
- llm langchain quick start☆16Jun 14, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python code to show basic sound separation using Ideal Binary Masks☆13Oct 13, 2018Updated 7 years ago
- ☆14Oct 12, 2023Updated 2 years ago
- 从vue出发一步一步靠近gis☆17Feb 10, 2022Updated 4 years ago
- Ambisonic Blind Reverberation Time Estimation☆12Jun 14, 2020Updated 6 years ago
- Communication-Cost Aware Microphone Selection For Neural Speech Enhancement with Ad-hoc Microphone Arrays☆17Nov 20, 2020Updated 5 years ago
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- A Tensorflow attempt to reimplement the IEEE VTC2019-Fall paper "DL-CFAR: A Novel CFAR Target Detection Method Based on Deep Learning"☆14Jul 29, 2023Updated 2 years ago
- 本项目是一款以datawhale小鲸鱼为原型设计的面向新人开发者的ai语音小车教程,附带整套打包硬件,帮助大家零门槛上手。教程正在持续更新中,项目交流群:请扫码进群☆55Jan 27, 2026Updated 4 months ago
- PASE: Phonologically Anchored Speech Enhancer☆67Apr 9, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repo of ICASSP 2022 paper - Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization☆20Jan 7, 2025Updated last year
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆26May 14, 2026Updated last month
- DistantSpeech☆22Oct 9, 2023Updated 2 years ago
- Toolbox for Evaluation of AEC/AES Systems☆39Feb 18, 2026Updated 3 months ago
- 几种VAD算法的测评☆25Jul 31, 2020Updated 5 years ago
- 📚 从零开始的向量数据库原理与实践教程,在线阅读地址:https://easy-vecdb.datawhale.cc/☆334Jun 1, 2026Updated 2 weeks ago
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆29Oct 14, 2024Updated last year