danny911kr / REALTALKLinks
Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.
☆36Updated 6 months ago
Alternatives and similar repositories for REALTALK
Users that are interested in REALTALK are comparing it to the libraries listed below
Sorting:
- ☆77Updated 4 months ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆54Updated 11 months ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆56Updated 10 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆86Updated 3 weeks ago
- ☆51Updated last month
- MIO: A Foundation Model on Multimodal Tokens☆33Updated last year
- ☆19Updated last year
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆182Updated 7 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆95Updated last week
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Updated last year
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆40Updated this week
- ☆42Updated last year
- EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Vi…☆72Updated 8 months ago
- PyTorch implementation of StableMask (ICML'24)☆15Updated last year
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆19Updated 8 months ago
- Code for paper "Patch-Level Training for Large Language Models"☆97Updated 2 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆84Updated 2 years ago
- A comprehensive framework to test audio comprehension of Large Audio Language Models.☆58Updated last week
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆89Updated last month
- ☆19Updated 5 months ago
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆123Updated 2 months ago
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆131Updated 4 months ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆49Updated 7 months ago
- ☆53Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆46Updated this week
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆84Updated 2 years ago
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆41Updated last year
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Updated 10 months ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆106Updated 7 months ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Updated 7 months ago