☆42May 23, 2023Updated 2 years ago
Alternatives and similar repositories for LM-Extraction
Users that are interested in LM-Extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆304May 11, 2026Updated last week
- A toolkit to assess data privacy in LLMs (under development)☆73Jan 2, 2025Updated last year
- ☆13Oct 20, 2022Updated 3 years ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- A human-annotated, fine-grained dataset for Vision-and-Language Navigation☆17Jan 20, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Baseline for REVERIE-Challenge using HOP☆10Jul 4, 2022Updated 3 years ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆85Oct 23, 2024Updated last year
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆65Jan 11, 2025Updated last year
- ☆13Jul 25, 2023Updated 2 years ago
- [ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Models☆23Oct 22, 2024Updated last year
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated last year
- End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP☆17Sep 23, 2024Updated last year
- Official code for "On Calibrating Diffusion Probabilistic Models"☆30Feb 22, 2023Updated 3 years ago
- interesting & promising & widely adopted tricks for SOTA performance in machine learning community.☆15Apr 13, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆91May 19, 2024Updated 2 years ago
- Benchmarking MIAs against LLMs.☆28Oct 8, 2024Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆27Oct 31, 2022Updated 3 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated last year
- Training data extraction on GPT-2☆194Feb 4, 2023Updated 3 years ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 4 months ago
- Graph Diffusion Policy Optimization☆43Mar 17, 2024Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆23May 1, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆37Jun 10, 2024Updated last year
- ☆20Oct 28, 2025Updated 6 months ago
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models☆49Dec 4, 2024Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Aug 15, 2023Updated 2 years ago
- ☆41Dec 19, 2024Updated last year
- ☆19Mar 19, 2023Updated 3 years ago
- ☆20Apr 16, 2025Updated last year
- Official implementation of AdvPrompter https//arxiv.org/abs/2404.16873☆181May 6, 2024Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆12Dec 13, 2023Updated 2 years ago
- Codebase for decoding compressed trust.☆27May 7, 2024Updated 2 years ago
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆243Nov 3, 2023Updated 2 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23May 8, 2023Updated 3 years ago
- Code for a research paper "Part-Based Models Improve Adversarial Robustness" (ICLR 2023)☆20Sep 16, 2023Updated 2 years ago
- Download, parse, and filter data PubMed, data-ready for The-Pile☆23Dec 16, 2021Updated 4 years ago