🚀 基于 ASR + VLM 技术的智能视频笔记工具,能够将任何视频"吞噬"并生成包含图文内容和视频剪影的结构化笔记报告
☆67Oct 18, 2025Updated 5 months ago
Alternatives and similar repositories for video-devour
Users that are interested in video-devour are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用langraph构建Agentic-RAG☆23Jul 30, 2025Updated 8 months ago
- 一款开源的基金估值框架☆25Mar 30, 2026Updated 2 weeks ago
- 碧树西风经典文章☆10Dec 17, 2021Updated 4 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 记录关于AEC的论文和代码、博客以及相关资料☆15Jul 26, 2022Updated 3 years ago
- 智谱 glm realtime api python/golang/ts sdk, 包括 low level 的 websocket client 封装以及各个场景的调用样例☆25May 27, 2025Updated 10 months ago
- a version tools. face detector,face landmark detector,face parsing and so on☆12Jul 30, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23May 11, 2024Updated last year
- MVDR beamformer written in python☆10Jul 2, 2021Updated 4 years ago
- 完整基于omlsa.m实现☆14Nov 26, 2021Updated 4 years ago
- ☆13Jan 14, 2025Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PASE: Phonologically Anchored Speech Enhancer☆46Apr 9, 2026Updated last week
- demos using speex☆12Apr 20, 2018Updated 7 years ago
- ☆16Jul 29, 2025Updated 8 months ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- WayLog - Save & Export AI Chat History. A local-first extension that turns your fleeting AI conversations into a permanent, git-friendly …☆55Jan 17, 2026Updated 2 months ago
- 项目的issue会存放我的所有blog☆19Sep 12, 2025Updated 7 months ago
- Understanding Deep Learning☆11Jul 23, 2024Updated last year
- Feedforward Sequential Memory Networks☆16Aug 2, 2022Updated 3 years ago
- A ChatGPT implementation with support for Bing's GPT-4 version of ChatGPT, plus the official ChatGPT model via OpenAI's API. Available as…☆11Feb 12, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- llm langchain quick start☆16Jun 14, 2023Updated 2 years ago
- Python code to show basic sound separation using Ideal Binary Masks☆13Oct 13, 2018Updated 7 years ago
- ☆14Oct 12, 2023Updated 2 years ago
- 从vue出发一步一步靠近gis☆17Feb 10, 2022Updated 4 years ago
- Ambisonic Blind Reverberation Time Estimation☆12Jun 14, 2020Updated 5 years ago
- 本项目是一款以datawhale小鲸鱼为原型设计的面向新人开发者的ai语音小车教程,附带整套打包硬件,帮助大家零门槛上手。教程正在持续更新中,项目交流群:请扫码进群☆53Jan 27, 2026Updated 2 months ago
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- 基于Eino实现数据分析处理智能体,目标尽可能的综合常见llm技术使该项目成为入门教程☆77Mar 11, 2026Updated last month
- A Tensorflow attempt to reimplement the IEEE VTC2019-Fall paper "DL-CFAR: A Novel CFAR Target Detection Method Based on Deep Learning"☆14Jul 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 📚 从零开始的向量数据库原理与实践教程,在线阅读地址:https://easy-vecdb.datawhale.cc/☆254Feb 12, 2026Updated 2 months ago
- Official repo of ICASSP 2022 paper - Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization☆20Jan 7, 2025Updated last year
- ☆20Nov 22, 2020Updated 5 years ago
- Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX☆25Oct 4, 2021Updated 4 years ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆25Jun 9, 2025Updated 10 months ago
- DistantSpeech☆22Oct 9, 2023Updated 2 years ago
- Codes and Solutions of "Numerical Linear Algebra by Trefethen"☆15Nov 16, 2023Updated 2 years ago