MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
☆28Apr 18, 2024Updated 2 years ago
Alternatives and similar repositories for MemoChat
Users that are interested in MemoChat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Controlled Memory System for LLMs☆50Apr 26, 2024Updated 2 years ago
- The git repository of Modular Prompted Chatbot paper☆35May 24, 2023Updated 2 years ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆25Nov 18, 2024Updated last year
- [NAACL 2025] The implementation of paper "Hello Again! LLM-powered Personalized Agent for Long-term Dialogue".☆78May 2, 2025Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- ☆12Nov 5, 2024Updated last year
- [ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen☆17Sep 7, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- ☆12Dec 13, 2023Updated 2 years ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆16Feb 24, 2025Updated last year
- SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025☆55Mar 1, 2025Updated last year
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …☆63Apr 21, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official repository for "Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory" accepted at EMNLP Find…☆39Oct 1, 2024Updated last year
- This repository contains the code for our paper "Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive …☆14Nov 23, 2023Updated 2 years ago
- ☆11Aug 13, 2024Updated last year
- ☆20Aug 14, 2025Updated 8 months ago
- PaddlePaddle Course☆12Mar 4, 2021Updated 5 years ago
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆36Nov 17, 2024Updated last year
- ☆18Jul 25, 2025Updated 9 months ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆148Jul 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- Do Large Language Models Know What They Don’t Know?☆103Nov 8, 2024Updated last year
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆45Apr 22, 2026Updated last week
- ☆28May 29, 2024Updated last year
- Source Code for Paper "Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning"☆19Jun 9, 2023Updated 2 years ago
- ☆34Jun 10, 2025Updated 10 months ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- ☆15Jun 2, 2019Updated 6 years ago
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆25May 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Synthetic data generation for TODs☆23Jul 17, 2024Updated last year
- ARI (Abstract Reasoning Induction) is an innovative framework designed to enhance the temporal reasoning capabilities of Large Language M…☆13Dec 29, 2024Updated last year
- ☆42Apr 13, 2026Updated 2 weeks ago
- [Remote Sensing 2022] PGNet: Positioning Guidance Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Images☆13Dec 9, 2022Updated 3 years ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆55Apr 7, 2026Updated 3 weeks ago
- Constructing community of LLM-based Agent in the minecraft☆17Nov 27, 2025Updated 5 months ago
- Code for the COLING 2022 paper "DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification"☆19Oct 19, 2022Updated 3 years ago