JusperLee / DolphinLinks
☆94Updated 2 weeks ago
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"☆239Updated last year
- PyTorch Implementation of VersBand(EMNLP 2025): Versatile Framework for Song Generation with Prompt-based Control☆218Updated last month
- Mini-Omni-Reasoner: a real-time speech reasoning framework that interleaves silent reasoning tokens with spoken response tokens (“thinkin…☆158Updated last month
- ☆12Updated 4 months ago
- Official code of ICML 2025 paper "NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Predicti…☆127Updated last month
- Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"☆138Updated last year
- Flexible RAG tools, Features semantic search, document indexing, and intelligent reranking with minimal intrusion design.☆89Updated last month
- GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling☆151Updated 7 months ago
- This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".☆104Updated 7 months ago
- [DCASE 2023] Official Implementation for "Low-Complexity Acoustic Scene Classification Using Deep Space Separable Distillation And Mutil-…☆25Updated 10 months ago
- 一个基于多个大语言模型的智能学术范文写作系统,能够根据输入的开题报告或研究设计文档,自动生成包含引用的学术范文的各章节内容。☆225Updated 3 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆121Updated 2 months ago
- Official repo for the paper "Multimodal Phased Transformer for Sentiment Analysis".☆182Updated 3 weeks ago
- Voice-Face Association Learning Evaluation☆49Updated last year
- linkedin, seek job information crawler☆104Updated 6 months ago
- DExter: Learning and Controlling Performance Expression through Diffusion models☆92Updated last year
- CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech☆363Updated 2 months ago
- An AI-powered conversational agent for recommending over-the-counter medications based on user symptoms and needs. Built with Python and …☆200Updated 2 months ago
- ☆343Updated 3 months ago
- 采集管家☆314Updated 4 months ago
- This repository implements Yolo functionality using TensorRT and CUDA acceleration on Nvidia Jetson devices and the ROS framework.☆203Updated 2 months ago
- a small MIPS Assembly with a graphical user Interface☆32Updated 7 years ago
- Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …☆204Updated 3 months ago
- Butter is a novel 2D object detection framework designed to enhance hierarchical feature representations for improved detection robustnes…☆83Updated 2 months ago
- [ICML 2025] PyTorch Implementation of "OmniAudio: Generating Spatial Audio from 360-Degree Video"☆330Updated 3 months ago
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆308Updated last year
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆422Updated 3 weeks ago
- This is the project for the paper at ICCV 2025☆29Updated this week
- 记录我的编程学习与成长之旅。涵盖技术笔记、项目实践、心得体会与日常思考☆85Updated last week
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆60Updated 7 months ago