Must-read papers and blogs about parametric knowledge mechanism in LLMs.
☆39May 9, 2025Updated 11 months ago
Alternatives and similar repositories for Awesome-Parametric-Knowledge-in-LLMs
Users that are interested in Awesome-Parametric-Knowledge-in-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 1, 2025Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Apr 30, 2025Updated 11 months ago
- ☆35Jun 13, 2025Updated 10 months ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- fast trainer for educational purposes☆26Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆24May 27, 2025Updated 10 months ago
- The implementation of ICASSP 2024 lecture presentation paper "Hypergraph Transformer for Semi-Supervised Classification"☆20Nov 23, 2024Updated last year
- [NeurIPS2022] Where to Pay Attention in Sparse Training for Feature Selection?☆12Feb 10, 2023Updated 3 years ago
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- [arxiv: 2503.23895] Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement☆177Aug 14, 2025Updated 8 months ago
- Zulip-based bot to respond users by ChatGPT☆18Feb 1, 2026Updated 2 months ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 9 months ago
- 中国科学院大学2023Fall-随机过程课程作业☆12Jan 21, 2024Updated 2 years ago
- Facilitating selective network routing for Ivanti-connected devices to a school's network, using port forwarding for enhanced access cont…☆16Mar 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆41Jul 1, 2025Updated 9 months ago
- ☆15Apr 29, 2025Updated 11 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆65Apr 11, 2026Updated last week
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 11 months ago
- ☆27Oct 20, 2021Updated 4 years ago
- ☆22Aug 8, 2025Updated 8 months ago
- 支持多种爬取方式,下载用户相册,爬取用户帖子,爬取实时搜索帖子等,欢迎下载使用和补充功能☆13Sep 24, 2023Updated 2 years ago
- [ICLR 2025] RGB-Event ISP: The Dataset and Benchmark☆21Mar 4, 2026Updated last month
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- SimKO: Simple Pass@K Policy Optimization☆30Oct 24, 2025Updated 5 months ago
- ☆19May 3, 2025Updated 11 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39May 28, 2024Updated last year
- 一个面向中国学生(尤其受10043政策影响)的香港、澳门、新加坡等地区导师信息库。An open-source database of professors in HK/MO/SG/etc. for Chinese students (esp. those affected…☆40Nov 26, 2025Updated 4 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Latex Template for Undergraduate Thesis at School of EECS, Peking University☆42Jun 3, 2022Updated 3 years ago
- ☆17Jun 10, 2025Updated 10 months ago
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SePer is an accurate / fast / free-of-API metric to measure document quality via information gain☆31Feb 22, 2026Updated last month
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 5 months ago
- Automated pipeline for generating, verifying, and preparing high-quality function calling datasets for model fine-tuning. Reduces manual …☆33Apr 9, 2026Updated last week
- Released codes of the RETIA model.☆19Mar 20, 2024Updated 2 years ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Awesome paper lists for "A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions""☆32Apr 25, 2025Updated 11 months ago