thu-coai / Backdoor-Data-ExtractionLinks
☆21Updated last month
Alternatives and similar repositories for Backdoor-Data-Extraction
Users that are interested in Backdoor-Data-Extraction are comparing it to the libraries listed below
Sorting:
- ☆9Updated last year
- Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique☆17Updated 10 months ago
- ☆13Updated 6 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- ☆46Updated 9 months ago
- ☆34Updated 7 months ago
- ☆45Updated last month
- ☆50Updated 3 weeks ago
- ☆24Updated 5 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆45Updated 7 months ago
- Verifiers for LLM Reinforcement Learning☆61Updated 2 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆12Updated 6 months ago
- ☆20Updated 3 months ago
- ☆51Updated 7 months ago
- Automated Safety Testing of Large Language Models☆15Updated 4 months ago
- The official implementation of Preference Data Reward-Augmentation.☆17Updated last month
- ☆41Updated 6 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Official Code Release for "Training a Generally Curious Agent"☆25Updated last month
- ☆31Updated 3 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆109Updated 3 weeks ago
- Reproducible Language Agent Research☆27Updated 3 months ago
- ☆24Updated 9 months ago
- ☆65Updated 2 months ago
- A curated list of materials on AI guardails☆38Updated 3 weeks ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆35Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago