MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
☆29Apr 18, 2024Updated 2 years ago
Alternatives and similar repositories for MemoChat
Users that are interested in MemoChat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Controlled Memory System for LLMs☆50Apr 26, 2024Updated 2 years ago
- Source code and demo for memory bank and SiliconFriend☆435May 24, 2023Updated 3 years ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆24Nov 18, 2024Updated last year
- [NAACL 2025] The implementation of paper "Hello Again! LLM-powered Personalized Agent for Long-term Dialogue".☆80May 2, 2025Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- ☆12Nov 5, 2024Updated last year
- [ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen☆17Sep 7, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025☆59Mar 1, 2025Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆11Nov 19, 2024Updated last year
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- ☆18Oct 26, 2024Updated last year
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …☆64Apr 21, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repository for "Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory" accepted at EMNLP Find…☆41Oct 1, 2024Updated last year
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…☆21Sep 10, 2024Updated last year
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Sep 20, 2024Updated last year
- ☆20Aug 14, 2025Updated 10 months ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆37Nov 17, 2024Updated last year
- ☆18Jul 25, 2025Updated 11 months ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆152Jul 24, 2024Updated last year
- [IEEE TMI 2024] Prototype-Guided Graph Reasoning Network for Few-Shot Medical Image Segmentation☆13Jun 13, 2025Updated last year
- Do Large Language Models Know What They Don’t Know?☆102Nov 8, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆50Apr 22, 2026Updated 2 months ago
- ☆14Dec 9, 2021Updated 4 years ago
- Source Code for Paper "Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning"☆19Jun 9, 2023Updated 3 years ago
- Matrix Iteratively Reweighted Least Squares for low-rank matrix completion and estimation☆12Dec 30, 2020Updated 5 years ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated 4 months ago
- ☆15Jun 2, 2019Updated 7 years ago
- Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding (CVPR 2025 Oral)☆42Nov 28, 2025Updated 7 months ago
- Starter for building with CopilotKit and LangGraph☆15Mar 12, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆25May 10, 2024Updated 2 years ago
- Synthetic data generation for TODs☆23Jul 17, 2024Updated last year
- ARI (Abstract Reasoning Induction) is an innovative framework designed to enhance the temporal reasoning capabilities of Large Language M…☆13Dec 29, 2024Updated last year
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆28May 30, 2023Updated 3 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆84Jul 18, 2025Updated 11 months ago
- BERT score for text generation☆12Jan 15, 2025Updated last year