danny911kr / REALTALKLinks
Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.
☆36Updated 7 months ago
Alternatives and similar repositories for REALTALK
Users that are interested in REALTALK are comparing it to the libraries listed below
Sorting:
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆54Updated 11 months ago
- ☆77Updated 4 months ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆56Updated 10 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆86Updated last month
- ☆51Updated 2 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆98Updated 2 weeks ago
- Code for paper "Patch-Level Training for Large Language Models"☆97Updated 3 months ago
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆45Updated 2 weeks ago
- ☆42Updated last year
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Updated last year
- MIO: A Foundation Model on Multimodal Tokens☆33Updated last year
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆132Updated 4 months ago
- PyTorch implementation of StableMask (ICML'24)☆15Updated last year
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Updated 2 years ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆25Updated 3 months ago
- Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"☆23Updated this week
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆39Updated 2 years ago
- ☆131Updated last week
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Updated 2 months ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆124Updated 3 months ago
- A spoken version of the textual story cloze benchmark☆20Updated 2 years ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆60Updated last year
- ☆84Updated 3 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Updated last year
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆84Updated 2 years ago
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆25Updated 8 months ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Updated 7 months ago
- ☆19Updated last year
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆108Updated 8 months ago