Official PyTorch+CUDA Full-functional Web Demo for MiniCPM-o 4.5
☆169May 1, 2026Updated last week
Alternatives and similar repositories for MiniCPM-o-Demo
Users that are interested in MiniCPM-o-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆32Aug 10, 2025Updated 8 months ago
- 一个实时交互的语音项目☆37Jan 29, 2026Updated 3 months ago
- ☆11Jan 15, 2020Updated 6 years ago
- ☆11May 4, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Render terminal ANSI output into images!☆16Mar 6, 2024Updated 2 years ago
- ☆18Mar 31, 2022Updated 4 years ago
- ☆16Oct 9, 2020Updated 5 years ago
- ☆15Apr 4, 2025Updated last year
- Economic Indicators data System.☆14Dec 22, 2022Updated 3 years ago
- ☆23Oct 17, 2024Updated last year
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- YOLOv7 Live Detection Application on Node.js☆14Dec 3, 2023Updated 2 years ago
- A collection of modular and reusable libraries and tools for semantic analysis of ink! smart contracts.☆14Nov 13, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Utilized attention incorporated UNet model for conditional image generation using Flow Matching with Conditional Optimal Transport Object…☆14Dec 29, 2023Updated 2 years ago
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆20Jun 3, 2024Updated last year
- 一个基于Rust开发,调用大模型接口完成任务流的工具☆18Sep 8, 2024Updated last year
- ` google-research / slot-attention-video ` but in pytorch.☆18Aug 10, 2022Updated 3 years ago
- WebGPU MSM implementation☆16Oct 28, 2025Updated 6 months ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆32Feb 6, 2026Updated 3 months ago
- ☆181Aug 25, 2025Updated 8 months ago
- A dynamic proxy middleware for webpack-dev-server that enables hot-swapping proxy configurations and mock data without restarting your de…☆17Feb 2, 2026Updated 3 months ago
- ☆10Apr 26, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Convert English text from written expressions into spoken forms☆28Jun 22, 2022Updated 3 years ago
- C code to extract mfcc or fbank features from wav files☆17Oct 25, 2019Updated 6 years ago
- ☆17Dec 8, 2024Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Op…☆31Dec 11, 2025Updated 4 months ago
- generate git commits with custom sha prefixes☆18Nov 1, 2022Updated 3 years ago
- Source code for the MICCAI 2019 MF-TAPNet paper☆23Nov 13, 2019Updated 6 years ago
- Memory-augmented Attention Modelling for Videos☆10Apr 24, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ETH, BSC, Polygon compatible. (May require changes in the code and the info of DEXs accordingly.)☆12Jul 11, 2022Updated 3 years ago
- The official repository of the Eesen project☆13Jun 20, 2018Updated 7 years ago
- ☆26Dec 23, 2021Updated 4 years ago
- ☆44Jun 25, 2025Updated 10 months ago
- Towards a general language-audio model for computational paralinguistic tasks☆24Dec 14, 2024Updated last year
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆24May 29, 2025Updated 11 months ago
- Rust Developer Tools for Solana and Anchor☆18Sep 16, 2024Updated last year