TaiMingLu/know-dont-tell

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TaiMingLu/know-dont-tell)

TaiMingLu / know-dont-tell

☆19

Alternatives and similar repositories for know-dont-tell

Users that are interested in know-dont-tell are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆12Jun 18, 2024Updated 2 years ago
p1nksnow / MoICE
View on GitHub
Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)
☆14Jan 7, 2025Updated last year
tau-nlp / zero_scrolls
View on GitHub
Running inference on the ZeroSCROLLS benchmark
☆22Apr 18, 2024Updated 2 years ago
RickySkywalker / LeanOfThought-Official
View on GitHub
This is the official implementation for MA-LoT.
☆20Aug 4, 2025Updated 11 months ago
princeton-nlp / ProLong
View on GitHub
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆261Sep 12, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
HKUST-KnowComp / IntentionQA
View on GitHub
Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …
☆12Apr 27, 2024Updated 2 years ago
TalnUPF / ConceptExtraction
View on GitHub
☆11Aug 15, 2023Updated 2 years ago
Zhaoyi-Li21 / creme
View on GitHub
[ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"
☆14Aug 28, 2024Updated last year
matchten / LoRA-Models-for-SAEs
View on GitHub
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆17Mar 31, 2025Updated last year
princeton-pli / LongProc
View on GitHub
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
☆36Feb 26, 2026Updated 5 months ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
October2001 / ProLong
View on GitHub
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆61Jul 23, 2024Updated 2 years ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RL10x / RetNet
View on GitHub
an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf
☆11Jul 25, 2023Updated 3 years ago
GaoxiangLuo / LLM-BioMed-NER-RE
View on GitHub
[npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction
☆13May 1, 2024Updated 2 years ago
adam-younes / calculator
View on GitHub
A graphing calculator written in c.
☆15Oct 17, 2023Updated 2 years ago
fedebotu / NeurIPS2022-OpenReviewData
View on GitHub
Crawl & Visualize NeurIPS 2022 Data from OpenReview
☆14Nov 8, 2022Updated 3 years ago
WowCZ / LongMIT
View on GitHub
LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets
☆43Sep 30, 2024Updated last year
MiniXC / opensubtitles-dataloader
View on GitHub
Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.
☆13Aug 26, 2020Updated 5 years ago
smilelight / lightSpider
View on GitHub
lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。
☆13Sep 30, 2020Updated 5 years ago
zhiyuanhubj / LongRecipe
View on GitHub
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
☆79Oct 16, 2024Updated last year
CityU-AIM-Group / PRR-Imbalance
View on GitHub
[TMI'22] Personalized Retrogress-Resilient Federated Learning Towards Imbalanced Medical Data
☆15Jul 20, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wesg52 / llm-context-neurons
View on GitHub
Find context neurons in Pythia models.
☆13Jun 13, 2023Updated 3 years ago
Pi-Star-Lab / csce642-deepRL
View on GitHub
Assignments of CSCE-642: Deep Reinforcement Learning offered at Texas A&M University.
☆10Aug 31, 2025Updated 10 months ago
zdou0830 / crosslingual_summarization_semantic
View on GitHub
☆10Jun 13, 2020Updated 6 years ago
Shen-Lab / Bayesian-L2O
View on GitHub
[ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…
☆14Aug 19, 2022Updated 3 years ago
assafbk / DeciMamba
View on GitHub
DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)
☆32Apr 9, 2025Updated last year
GDPlumb / ExpO
View on GitHub
Explanation Optimization
☆13Oct 16, 2020Updated 5 years ago
cordercorder / nmt-multi
View on GitHub
Codebase for multilingual neural machine translation
☆13Nov 24, 2022Updated 3 years ago
greg-kennedy / p5-NRL-TextToPhoneme
View on GitHub
Perl implementation of the Naval Research Laboratory text-to-phoneme algorithm, described by Elovitz et al (1976)
☆17May 7, 2020Updated 6 years ago
declare-lab / safety-arithmetic
View on GitHub
☆13Jan 14, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wzq016 / PINE
View on GitHub
Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""
☆23Jun 13, 2025Updated last year
yangarbiter / rare-spurious-correlation
View on GitHub
Understanding Rare Spurious Correlations in Neural Network
☆12Jun 5, 2022Updated 4 years ago
zlin7 / LVD
View on GitHub
Locally Valid and Discriminative Prediction Intervals for Deep Learning Models
☆13May 22, 2023Updated 3 years ago
IGITUGraz / SparseAdversarialTraining
View on GitHub
Code for "Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling" [ICML 2021]
☆10Mar 14, 2022Updated 4 years ago
nikvaessen / Rethinking-Binarized-Neural-Network-Optimization
View on GitHub
Reproduction of "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization" for the Reproducibility challenge@NeurIPS…
☆11Jan 14, 2020Updated 6 years ago
OpenMOSS / Lorsa
View on GitHub
☆30Nov 9, 2025Updated 8 months ago
Puhao / cc98
View on GitHub
cc98爬虫
☆15Sep 1, 2013Updated 12 years ago