[ACL 2025] Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
☆31Aug 11, 2025Updated 6 months ago
Alternatives and similar repositories for MM-F2F
Users that are interested in MM-F2F are comparing it to the libraries listed below
Sorting:
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ☆17Apr 12, 2021Updated 4 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 11 months ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- ☆38Apr 3, 2025Updated 11 months ago
- ☆34Jun 15, 2021Updated 4 years ago
- ☆41May 15, 2023Updated 2 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆56Feb 20, 2026Updated 2 weeks ago
- ☆43Aug 17, 2024Updated last year
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆38Feb 17, 2025Updated last year
- ☆12Feb 16, 2024Updated 2 years ago
- Project Gold ✨☆11Jan 29, 2026Updated last month
- Terrier's desktop search demo product☆13Aug 2, 2018Updated 7 years ago
- Extrinsic calibration for the color and depth cameras of Pepper robot☆10Dec 1, 2017Updated 8 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- A virtual musical instrument built using Google MediaPipe.☆12Oct 10, 2022Updated 3 years ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆49Aug 15, 2025Updated 6 months ago
- A curated collection of my agent-skills☆25Jan 25, 2026Updated last month
- Real-time MIDI fuzzy chord and scale identification☆15Nov 8, 2023Updated 2 years ago
- The resources for the paper "User Modeling with Click Preference and Reading Satisfaction for News Recommendation"☆11Jan 17, 2021Updated 5 years ago
- Unofficial repo for SubTab with additional code and data for Adult Income and BlogFeedback datasets. BlogFeedback data is attached as zip…☆10Jun 24, 2022Updated 3 years ago
- ☆13Sep 25, 2024Updated last year
- SimEc code relying on the theano library - check out the simec repo instead for keras based code!☆10Feb 28, 2018Updated 8 years ago
- ☆12Jan 11, 2018Updated 8 years ago
- ☆12Nov 5, 2024Updated last year
- ☆10Oct 17, 2021Updated 4 years ago
- 一个基于原生浏览器书签的知识库:用 GitHub Gist 跨浏览器同步书签,并用 AI 为书签生成摘要、标签和封面,提供一个简洁的 Web 端浏览体验。☆30Jan 5, 2026Updated 2 months ago
- ☆11Dec 11, 2024Updated last year
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- xState-based validation tool for OCF files☆15Apr 10, 2025Updated 10 months ago
- Official implementation of INTERSPECCH 2022 Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆16Sep 19, 2025Updated 5 months ago
- Simulation and Visualization tool for the Robot Raconteur robotics middleware☆11May 6, 2020Updated 5 years ago
- ☆13Oct 25, 2024Updated last year
- Exploratory search engine based on hierarchical topic models from BigARTM☆13Mar 8, 2022Updated 3 years ago
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago