The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Poster).
☆70Sep 29, 2025Updated 5 months ago
Alternatives and similar repositories for MemoryDecoder
Users that are interested in MemoryDecoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The resources for the paper "User Modeling with Click Preference and Reading Satisfaction for News Recommendation"☆11Jan 17, 2021Updated 5 years ago
- Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding☆96Dec 2, 2025Updated 3 months ago
- ☆15May 17, 2022Updated 3 years ago
- Source codes for our paper "Neural Temporality Adaptation for Document Classification: Diachronic Word Embeddings and Domain Adaptation M…☆12Apr 20, 2021Updated 4 years ago
- ☆12Nov 7, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- text to speech☆10Mar 19, 2024Updated 2 years ago
- ☆15Apr 29, 2025Updated 10 months ago
- A Collection of Papers about Memory for Language Agents☆393Jan 21, 2026Updated 2 months ago
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆55Dec 25, 2025Updated 3 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…☆14Jan 19, 2023Updated 3 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"☆35Nov 21, 2024Updated last year
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 4 months ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- EDA toolchain for processing-in-memory architectures, including an architecture synthesizer, a compiler, and a simulator☆19Jun 12, 2025Updated 9 months ago
- ☆19May 11, 2023Updated 2 years ago
- ☆10Jul 13, 2024Updated last year
- 算法原理讲解及Python实现☆12Nov 2, 2021Updated 4 years ago
- Implementation of Materials Discovery with Extreme properties via AI-Driven Combinatorial Chemistry☆10May 8, 2024Updated last year
- End-to-End Super Resolution Object Detection Networks☆12Jun 8, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This project is an implementation of two-step object detection (super-resolution and object detection) to address degradation of object d…☆10May 29, 2021Updated 4 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 8 months ago
- Model implementation for the contextual embeddings project☆43Jun 2, 2025Updated 9 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- Code for "MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification"☆10Aug 26, 2024Updated last year
- 通过维基百科构建的一个中文同义词库,每一行(每一个\n分隔)为一组同义词。☆16Jul 19, 2023Updated 2 years ago
- Public filtered data sets of AIS Trajectories from Danish Waters. Data sets vary in ROI size, time period, included ship types ect. Some …☆14Oct 23, 2023Updated 2 years ago
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆14Feb 7, 2025Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Mar 12, 2024Updated 2 years ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆41May 30, 2025Updated 9 months ago
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- The implementation of g2pL with a new open dataset.☆16May 14, 2023Updated 2 years ago
- Anomaly detection from ships' Automatic Identification System (AIS) data☆10Nov 2, 2024Updated last year
- Official Repository of the Deep Diacritization Paper☆17Dec 16, 2020Updated 5 years ago
- A geometric-driven semi-supervised approach for fishing activity detection from AIS data.☆12Aug 24, 2022Updated 3 years ago