MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
☆28Apr 18, 2024Updated 2 years ago
Alternatives and similar repositories for MemoChat
Users that are interested in MemoChat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Controlled Memory System for LLMs☆50Apr 26, 2024Updated 2 years ago
- Source code and demo for memory bank and SiliconFriend☆429May 24, 2023Updated 2 years ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆25Nov 18, 2024Updated last year
- [NAACL 2025] The implementation of paper "Hello Again! LLM-powered Personalized Agent for Long-term Dialogue".☆80May 2, 2025Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- ☆12Nov 5, 2024Updated last year
- ☆12Dec 13, 2023Updated 2 years ago
- SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025☆55Mar 1, 2025Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆11Nov 19, 2024Updated last year
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …☆63Apr 21, 2026Updated last month
- This repository contains the code for our paper "Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive …☆14Nov 23, 2023Updated 2 years ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Sep 20, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12Aug 13, 2024Updated last year
- ☆20Aug 14, 2025Updated 9 months ago
- ☆11Feb 28, 2024Updated 2 years ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆37Nov 17, 2024Updated last year
- Official repository of DialSim☆32Oct 31, 2025Updated 6 months ago
- ☆18Jul 25, 2025Updated 9 months ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆149Jul 24, 2024Updated last year
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- [IEEE TMI 2024] Prototype-Guided Graph Reasoning Network for Few-Shot Medical Image Segmentation☆12Jun 13, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Do Large Language Models Know What They Don’t Know?☆102Nov 8, 2024Updated last year
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆47Apr 22, 2026Updated last month
- ☆14Dec 9, 2021Updated 4 years ago
- ☆28May 29, 2024Updated last year
- ☆34Jun 10, 2025Updated 11 months ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆21Jun 13, 2025Updated 11 months ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated 3 months ago
- ☆15Jun 2, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Synthetic data generation for TODs☆23Jul 17, 2024Updated last year
- ARI (Abstract Reasoning Induction) is an innovative framework designed to enhance the temporal reasoning capabilities of Large Language M…☆13Dec 29, 2024Updated last year
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆28May 30, 2023Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆57Apr 7, 2026Updated last month
- SalesBot 2.0☆30Aug 14, 2025Updated 9 months ago
- ☆17Nov 29, 2023Updated 2 years ago