JokingXie / meeting-minutesLinks
基于语音识别和自然语言处理技术,自动完成会议录音的说话人分离、内容转译,并智能生成会议纪要。
☆19Updated 4 months ago
Alternatives and similar repositories for meeting-minutes
Users that are interested in meeting-minutes are comparing it to the libraries listed below
Sorting:
- Programming with local large language model.☆24Updated last month
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year
- 文本语料转训练集工具,txt转dataset☆94Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆49Updated last week
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- 基于ultralytics训练的行人跌倒检测模型☆19Updated 2 years ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆81Updated 6 months ago
- DH-Live-Web-UI☆19Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆64Updated last year
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆116Updated 4 months ago
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆73Updated 2 months ago
- Sample GLM4V + ChatTTS AI assistant☆85Updated last year
- 使用FastAPI+vLLM部署Qwen2.5☆24Updated last year
- 在DH_live项目基础上修改,添加webui界面☆72Updated 7 months ago
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆166Updated last year
- 基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署☆34Updated last year
- 使用 FastAPI、Streamlit本地部署ChatTTS文本转语音模型,并通过 Docker Compose 进行容器化部署。☆27Updated last year
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆206Updated last year
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆39Updated last year
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆425Updated 11 months ago
- generate ppt with llm☆104Updated last year
- Простое WebUI на Flask для EasyWav2Lip☆27Updated last year
- Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆87Updated this week
- Qwen 提示词工程 & 最佳实践☆37Updated last year
- 基于 faster-whisper 的伪实时语音转写服务☆232Updated 7 months ago
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆182Updated 2 weeks ago
- Just a suturing monster project.☆42Updated 2 years ago
- 优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs☆80Updated 11 months ago
- 视频分类标注、视频时空标注☆44Updated 2 years ago