一个讯飞智能语音平台 MSC 的第三方 Python SDK,支持语音唤醒、语音识别、语音合成、语音评测等功能。A third-party Python SDK for a iFLYTEK MSC. Using for ASR, TSS, KWS.
☆23Jan 27, 2024Updated 2 years ago
Alternatives and similar repositories for iFLYTEK-MSC-Python-SDK
Users that are interested in iFLYTEK-MSC-Python-SDK are comparing it to the libraries listed below
Sorting:
- Official repository for the WACV 2024 paper "Multi-view Classification with Hybrid Fusion and Mutual Distillation"☆15Jan 16, 2024Updated 2 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Applescripts for controlling Spotify☆23Oct 20, 2016Updated 9 years ago
- 智能控制结课作业实验代码实现部分,包括模糊控制器和PID控制器实现以及控制器参数优化整定,PID参数采用Nelder-Mead优化,模糊控制器参数采用遗传算法优化。☆10Dec 2, 2024Updated last year
- Multiple correspondence analysis☆10Apr 2, 2015Updated 10 years ago
- [ACL 2025] Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL☆12Oct 9, 2025Updated 5 months ago
- Face detection using Multi-scale Block Local Binary Pattern algorithm - optimized with OpenCL/OpenMP - Depreciated - pls use convolutiona…☆11Jul 16, 2017Updated 8 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Helper methods for Pandas Series and DataFrames to calculate numerically derivative and integral☆11Jun 7, 2019Updated 6 years ago
- 大数据监控面板☆13Oct 9, 2020Updated 5 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- ☆10Apr 12, 2023Updated 2 years ago
- 微信小程序:导航定位 录音 瀑布流 三级地址联动☆13Feb 27, 2019Updated 7 years ago
- Code used in the project "Leveraging data assimilation and monitoring data for improvement of crop growth estimates in protected environm…☆13Aug 11, 2024Updated last year
- ☆10Sep 24, 2017Updated 8 years ago
- Evaluation repository of wikipedia index with Dria☆10Mar 14, 2024Updated last year
- course project for ECE 9603B Data Analytics Foundations☆10Aug 30, 2019Updated 6 years ago
- Code for the paper "KG-Adapter: Enabling Knowledge Graph Integration in Large Language Models through Parameter-Efficient Fine-Tuning"☆14Oct 21, 2025Updated 4 months ago
- python opencv 文档照片与证件照片的仿射变换的矫正☆11Nov 3, 2020Updated 5 years ago
- 分析优酷、土豆等视频网站的播放页面,获取视频标题、视频截图、M3U8地址以及插入页面的swf地址。☆11May 5, 2014Updated 11 years ago
- ☆12Apr 9, 2025Updated 11 months ago
- ☆13May 12, 2025Updated 9 months ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Fast and memory-efficient exact attention ported to rocm☆13Dec 1, 2023Updated 2 years ago
- 一个文心千帆平台的第三方 Python SDK。A third-party Python SDK for a WenxinWorkshop.☆12Aug 23, 2023Updated 2 years ago
- bp神经网络调节PID☆10Apr 19, 2023Updated 2 years ago
- Gem to allow easy access to data from the WIPO PATENTSCOPE Web Service☆18Jan 11, 2021Updated 5 years ago
- Repository of the Tranferlab Practical Anomaly Detection workshop☆13Jun 14, 2024Updated last year
- Comparing performance of different InfoNCE type losses used in contrastive learning.☆14Jun 12, 2024Updated last year
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- Codes for ACL2023 paper: Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.☆11Sep 23, 2023Updated 2 years ago
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 5 years ago
- ☆25Aug 7, 2025Updated 7 months ago
- BASE-SQL: A powerful open source Text-To-SQL baseline approach☆13Feb 18, 2025Updated last year
- A Chinese Modular Speech Robot Framework Using Single-Wheel Dialogue Design | 一个采用单轮对话设计的中文模块化语音机器人框架☆13Feb 2, 2025Updated last year
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- Huggingface Implementation of AV-HuBERT on the MuAViC Dataset☆18Mar 6, 2025Updated last year
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year