[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆52May 26, 2025Updated 9 months ago
Alternatives and similar repositories for mink-plus-plus
Users that are interested in mink-plus-plus are comparing it to the libraries listed below
Sorting:
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆242Nov 3, 2023Updated 2 years ago
- Python package for measuring memorization in LLMs.☆183Jul 16, 2025Updated 7 months ago
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆22May 21, 2025Updated 9 months ago
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆17Oct 25, 2024Updated last year
- The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)☆16May 21, 2024Updated last year
- ☆22Dec 22, 2024Updated last year
- HippoMM: Hippocampal-inspired Multimodal Memory☆15May 22, 2025Updated 9 months ago
- ☆13Oct 20, 2022Updated 3 years ago
- ☆18Feb 20, 2024Updated 2 years ago
- Counterexample-Guided Learning of Monotonic Networks☆18May 19, 2022Updated 3 years ago
- The Source Code for OmniVideoBench @ICLR 2026☆61Feb 12, 2026Updated 3 weeks ago
- Code for paper "Membership Inference Attacks Against Vision-Language Models"☆26Jan 25, 2025Updated last year
- Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''☆20Aug 9, 2023Updated 2 years ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated 10 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- Official code for the paper "PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models".☆15Dec 8, 2022Updated 3 years ago
- Knowledge distillation (KD) from a decision-based black-box (DB3) teacher without training data.☆22May 3, 2022Updated 3 years ago
- The official codes for the ICCV2021 presentation "Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counti…☆25Oct 25, 2021Updated 4 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Jul 23, 2021Updated 4 years ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆33Nov 1, 2025Updated 4 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆66Sep 30, 2024Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆128Mar 30, 2024Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated last year
- ☆25Nov 14, 2022Updated 3 years ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- ☆32Apr 14, 2023Updated 2 years ago
- Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"☆34Nov 5, 2024Updated last year
- Code for "Taxonomy Adaptive Cross-Domain Adaptation in Medical Imaging via Optimization Trajectory Distillation", ICCV 2023☆16Aug 31, 2023Updated 2 years ago
- ☆38Dec 19, 2024Updated last year
- Official code link for ''Iterative Document Representation Learning Towards Summarization with Polishing''☆60Nov 26, 2018Updated 7 years ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆36Jun 8, 2023Updated 2 years ago
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆16Feb 15, 2023Updated 3 years ago
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago
- LoFiT: Localized Fine-tuning on LLM Representations☆44Jan 15, 2025Updated last year
- This is the official GDSC repo with all of the source code presented in the video tutorials☆14Jun 27, 2023Updated 2 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- ☆16Feb 27, 2026Updated last week
- ☆10Oct 2, 2024Updated last year