amazon-science / slang-llm-benchmarkLinks
☆18Updated last year
Alternatives and similar repositories for slang-llm-benchmark
Users that are interested in slang-llm-benchmark are comparing it to the libraries listed below
Sorting:
- Train Station Computer Vision demo☆13Updated 3 years ago
- This repository contains the metadata and data of different databases that we use for testing☆15Updated 7 months ago
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆193Updated last week
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆451Updated last year
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆595Updated last year
- Lots of PRO courses Forever FREE☆10Updated 2 years ago
- Testing OpenAi Whisper models on a Raspberry PI 5☆24Updated last year
- ☆12Updated 4 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆294Updated 2 months ago
- The Open Source Code of UniAudio☆574Updated last year
- 🤗 R1-AQA Model: mispeech/r1-aqa☆296Updated 5 months ago
- Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.☆691Updated last year
- PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models☆727Updated last week
- A Survey of Spoken Dialogue Models (60 pages)☆308Updated 9 months ago
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆267Updated 7 months ago
- Audio Large Language Models☆690Updated 2 months ago
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆179Updated 9 months ago
- Keep track of big models in audio domain, including speech, singing, music etc.☆493Updated 11 months ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.☆388Updated 3 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆419Updated 11 months ago
- Speech, Language, Audio, Music Processing with Large Language Model☆885Updated 3 weeks ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆297Updated last month
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch☆495Updated 5 months ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆487Updated 6 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆293Updated 3 months ago
- ☆222Updated 3 months ago
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆64Updated last year
- ☆17Updated this week
- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM☆338Updated 3 months ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆37Updated 11 months ago