XinshuangL/SELF-PARAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XinshuangL/SELF-PARAM)

XinshuangL / SELF-PARAM

The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"

☆15

Alternatives and similar repositories for SELF-PARAM

Users that are interested in SELF-PARAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FranxYao / Complexity-Based-Prompting
View on GitHub
Complexity Based Prompting for Multi-Step Reasoning
☆17Mar 10, 2023Updated 3 years ago
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
nsfzyzz / Generalization_metrics_for_NLP
View on GitHub
[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…
☆12Oct 17, 2022Updated 3 years ago
McGill-NLP / retriever-lm-reasoning
View on GitHub
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…
☆28Nov 2, 2023Updated 2 years ago
swj0419 / kNN_prompt
View on GitHub
TBC
☆28Nov 2, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
wangyu-ustc / Mem-alpha
View on GitHub
The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"
☆218Dec 25, 2025Updated 7 months ago
leezythu / FocusLLM
View on GitHub
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆45Dec 8, 2024Updated last year
Trae1ounG / DyPRAG
View on GitHub
[arxiv: 2503.23895] Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement
☆182Aug 14, 2025Updated 11 months ago
IBM / larimar
View on GitHub
Code for ICML 2024 paper
☆34Sep 18, 2025Updated 10 months ago
icip-cas / OmniBehavior
View on GitHub
Code for "Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous…
☆46May 18, 2026Updated 2 months ago
xiusic / MinPrompt
View on GitHub
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
☆14May 3, 2024Updated 2 years ago
Zce1112zslx / IKE
View on GitHub
☆41Nov 30, 2023Updated 2 years ago
VanillaCreamer / CoRA
View on GitHub
Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…
☆18Dec 11, 2024Updated last year
leezythu / Awesome-Harness-Self-Improvement
View on GitHub
A curated reading list on harness engineering for recursive self-improvement of LLM agents (EN/ZH).
☆19Jul 9, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TrustedLLM / UnKE
View on GitHub
☆24Feb 18, 2025Updated last year
aNOnWhyMooS / connectivity
View on GitHub
☆18Jan 17, 2024Updated 2 years ago
lorelupo / divide-and-rule
View on GitHub
☆12Oct 17, 2022Updated 3 years ago
gingasan / interactive-drama
View on GitHub
☆26Mar 4, 2025Updated last year
leezythu / UCTR-ST
View on GitHub
☆17Jun 21, 2024Updated 2 years ago
zjunlp / Kformer
View on GitHub
[NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers
☆39Oct 20, 2022Updated 3 years ago
MozerWang / Loong
View on GitHub
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
☆155Dec 22, 2025Updated 7 months ago
nguyentthong / CrossSummOptimalTransport
View on GitHub
☆24May 19, 2023Updated 3 years ago
HYU-NLP / Hyper-CL
View on GitHub
Official Repository for "Hyper-CL: Conditioning Sentence Representations with Hypernetworks"
☆16Jun 3, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
12wang3 / mllp
View on GitHub
The code of AAAI 2020 paper "Transparent Classification with Multilayer Logical Perceptrons and Random Binarization".
☆23Mar 10, 2024Updated 2 years ago
DanielSc4 / Dynamic-Activation-Composition
View on GitHub
Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"
☆14Nov 22, 2024Updated last year
CalculatedContent / ww-trends-2020
View on GitHub
☆18Mar 25, 2021Updated 5 years ago
yasumasaonoe / ET4EL
View on GitHub
☆22Feb 14, 2023Updated 3 years ago
Slide-extractor-beta / slide-extractor
View on GitHub
An easy tool to extract slides from presentations ( lectures 😉 )
☆14Dec 10, 2023Updated 2 years ago
seitlab / Glint
View on GitHub
☆10Oct 25, 2024Updated last year
catlover627 / neko-atsume-analysis
View on GitHub
☆22Jun 16, 2025Updated last year
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
johnnie193 / Open-Theatre
View on GitHub
Open-Theatre: An Open-Source Toolkit for LLM-based Interactive Drama
☆28Oct 20, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Zhudongsheng75 / Divide-Then-Aggregate
View on GitHub
(ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation
☆12May 21, 2025Updated last year
Red-Hat-AI-Innovation-Team / mini_trainer
View on GitHub
fast trainer for educational purposes
☆26Updated this week
kaistAI / factual-knowledge-acquisition
View on GitHub
☆25Dec 12, 2025Updated 7 months ago
TsinghuaAI / TDS
View on GitHub
A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 5 years ago
nsfzyzz / loss_landscape_taxonomy
View on GitHub
[NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228
☆20Jan 7, 2022Updated 4 years ago
zx1239856 / UndergradProjects
View on GitHub
Collections of Undergraduate Course Projects
☆22Jul 17, 2026Updated last week
Glaciohound / LM-Infinite
View on GitHub
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆153Mar 13, 2025Updated last year