Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)
☆25Oct 22, 2024Updated last year
Alternatives and similar repositories for FastMem
Users that are interested in FastMem are comparing it to the libraries listed below
Sorting:
- ☆16Jun 25, 2025Updated 8 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆19Jun 13, 2025Updated 8 months ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆28Dec 18, 2024Updated last year
- ☆23Dec 17, 2024Updated last year
- A feature-rich concurrency kit, yet another DAG framework☆10Jan 18, 2026Updated last month
- Paster core module using KiteX☆10Aug 30, 2023Updated 2 years ago
- ☆38Jan 17, 2025Updated last year
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 7 months ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated last month
- A job management system for python☆10Jan 16, 2026Updated last month
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- concurrent map implementation using bucket list like a skip list.☆10May 29, 2022Updated 3 years ago
- Test equality between a black-box LLM API and a reference distribution☆12Oct 29, 2024Updated last year
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- An alternative to elasticsearch engine written in Go for small set of documents that uses inverted index to build the index and utilizes …☆15Jun 14, 2020Updated 5 years ago
- My dotfiles config... Feel free to use☆10Jan 23, 2026Updated last month
- A/B Test knowledge system(AB实验知识体系).☆12Sep 24, 2020Updated 5 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- Accurate counters with Kafka & RocksDB.☆15Jan 22, 2021Updated 5 years ago
- ☆16May 16, 2025Updated 9 months ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- ☆12Apr 25, 2025Updated 10 months ago
- Simple, Non authoritative Benchmarks for embedded databases running in Github Actions☆11Jul 11, 2024Updated last year
- go-superviser for restart a go service☆18Dec 14, 2012Updated 13 years ago
- ☆17Dec 22, 2021Updated 4 years ago
- (deprecated) Sentinel Go data-source modules☆11Dec 9, 2020Updated 5 years ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- Kernel Library Wheel for SGLang☆16Updated this week
- ☆26Jan 4, 2026Updated last month
- Dynamic Traffic Shaper☆12Jun 4, 2018Updated 7 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- ☆14Jun 19, 2022Updated 3 years ago
- An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.☆10Dec 11, 2024Updated last year
- 基于python3训练中文wiki词向量、字向量、拼音向量☆12Jan 2, 2022Updated 4 years ago
- A sophisticated web application designed to revolutionize the resume screening process by harnessing the power of multiple state-of-the-a…☆12Mar 13, 2025Updated 11 months ago