aimclub / OCEANAI
Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perform professional responsibilities
β35Updated 3 months ago
Alternatives and similar repositories for OCEANAI:
Users that are interested in OCEANAI are comparing it to the libraries listed below
- π Awesome lists about Speech Emotion Recognitionβ83Updated 3 months ago
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ15Updated last year
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systemsβ81Updated last year
- β39Updated last week
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ143Updated last year
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronizationβ48Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β61Updated 2 weeks ago
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translationβ145Updated last month
- Mirror of hf.co/pyannote/speaker-diarization-3.1β20Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β18Updated 5 months ago
- Multi-modal Human Emotion Recognition of speech clips (audio + video) contained in RAVDESS dataset using a two stream architectureβ29Updated 2 years ago
- VoiceBench: Benchmarking LLM-Based Voice Assistantsβ159Updated this week
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Unitsβ31Updated 5 months ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (β¦β59Updated 9 months ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Modelsβ43Updated 9 months ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)β27Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β33Updated this week
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"β10Updated 2 years ago
- Speaker diarization serviceβ21Updated last month
- Efficient approach to speaker diarization using voice characteristics extractionβ93Updated 11 months ago
- β121Updated 7 months ago
- [Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognitionβ100Updated 5 months ago
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmarkβ217Updated 9 months ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances β¦β63Updated 3 months ago
- Use quantized versions of Whisper to speed up inferenceβ12Updated 5 months ago
- Create a knowledge graph out of unstructed legal text - use said knowledge graph in a graph augmented retrieval augmented generation pipeβ¦β39Updated 6 months ago
- β15Updated 6 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β62Updated 10 months ago
- The official implementation of EmoSphere++β80Updated 2 weeks ago
- [WACV 2023] Audio-Visual Efficient Conformer (AVEC) for Robust Speech Recognitionβ93Updated 2 years ago