Official PyTorch+CUDA Full-functional Web Demo for MiniCPM-o 4.5
☆203May 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for MiniCPM-o-Demo
Users that are interested in MiniCPM-o-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆32Aug 10, 2025Updated 9 months ago
- 一个实时交互的语音项目☆40May 20, 2026Updated last week
- ☆18Mar 31, 2022Updated 4 years ago
- ☆15Apr 4, 2025Updated last year
- ☆23Oct 17, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- Utilized attention incorporated UNet model for conditional image generation using Flow Matching with Conditional Optimal Transport Object…☆14Dec 29, 2023Updated 2 years ago
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆21Jun 3, 2024Updated last year
- ☆184Aug 25, 2025Updated 9 months ago
- OpenSFEDS, a near-eye gaze estimation dataset containing approximately 2M synthetic camera-photosensor image pairs sampled at 500 Hz unde…☆13Apr 18, 2024Updated 2 years ago
- Copied from official repo of VITS. Added some comments.☆19Sep 24, 2024Updated last year
- ☆10Apr 26, 2017Updated 9 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- C code to extract mfcc or fbank features from wav files☆17Oct 25, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Dec 8, 2024Updated last year
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 6 months ago
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆31Dec 11, 2025Updated 5 months ago
- Source code for the MICCAI 2019 MF-TAPNet paper☆23Nov 13, 2019Updated 6 years ago
- ☆26Dec 23, 2021Updated 4 years ago
- Towards a general language-audio model for computational paralinguistic tasks☆24Dec 14, 2024Updated last year
- ☆14Oct 3, 2025Updated 7 months ago
- ☆17Aug 5, 2025Updated 9 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- Code for Detecting language from text in python using fasttext☆13May 25, 2020Updated 6 years ago
- Experiments for TwinNet paper☆13Apr 9, 2018Updated 8 years ago
- 使用onnxruntime部署LivePortrait人像动画生成,包含C++和Python两个版本的程序☆33Aug 5, 2024Updated last year
- Speech Signal Processing project with different types of filters.☆10Aug 7, 2017Updated 8 years ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)☆20Aug 13, 2025Updated 9 months ago
- ☆20Apr 21, 2026Updated last month
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆64Apr 8, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The Tensorflow implement of the DHCNet☆11Aug 4, 2018Updated 7 years ago
- French-English translator using word embeddings, bi-directional encoder, and decoder with attention☆15Jun 20, 2018Updated 7 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Nov 27, 2023Updated 2 years ago
- A simple generate script utils using fastchat conv template for generation of Large Language Models☆21Jun 21, 2023Updated 2 years ago
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆23Jul 25, 2024Updated last year
- ☆29Aug 19, 2024Updated last year