mddunlap924 / PII-Detection

Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation

☆25

Alternatives and similar repositories for PII-Detection:

Users that are interested in PII-Detection are comparing it to the libraries listed below

atla-ai / selene-mini
☆18Updated 2 weeks ago
aws-samples / evaluating-large-language-models-using-llm-as-a-judge
☆16Updated 3 months ago
ali-bahrainian / RAG_best_practices
☆90Updated last month
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated 9 months ago
FSoft-AI4Code / XMainframe
Language Model for Mainframe Modernization
☆51Updated 8 months ago
LitLLM / litllms-for-literature-review-tmlr
Code for LitLLMs, LLMs for Literature Review: Are we there yet? (TMLR 2025)
☆21Updated last week
javyduck / KnowHalu
☆47Updated 11 months ago
mddunlap924 / LangChain-SynData-RAG-Eval
LangChain, Llama2-Chat, and zero- and few-shot prompting are used to generate synthetic datasets for IR and RAG system evaluation
☆36Updated last year
liuqi6777 / llm4ranking
Large language models for document ranking.
☆48Updated last week
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆109Updated last week
wandb / aihackercup
A competition to get you started on the NeurIPS AI Hackercup
☆28Updated 7 months ago
sujitpal / llm-rag-eval
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
☆26Updated 11 months ago
MultiagentBench / MARBLE
Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.01935
☆96Updated last month
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆57Updated last year
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆115Updated this week
zetaalphavector / RAGElo
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
☆108Updated 2 weeks ago
MontrealAI / AGI-Agent-v0
A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…
☆25Updated 3 months ago
rungalileo / ragbench
☆11Updated 10 months ago
FSoft-AI4Code / CodeMMLU
[ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.
☆23Updated last week
YeonwooSung / MLOps
Miscellaneous codes and writings for MLOps
☆12Updated last week
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆77Updated 7 months ago
microsoft / agdebugger
☆38Updated 2 weeks ago
kevinwu23 / StanfordFineTuneBench
☆28Updated 5 months ago
terryyz / llm-benchmark
A list of LLM benchmark frameworks.
☆66Updated last year
lucifertrj / Lats-Agent-RecSys
Build a Recommendation System Agent using LATS Agent Approach
☆29Updated 2 months ago
mrmaheshrajput / productionizing-llms
Code Repository for Blog - How to Productionize Large Language Models (LLMs)
☆11Updated last year
Mohammadjafari80 / GSM8K-RLVR
A simplified implementation for experimenting with Reinforcement Learning (RL) on GSM8K, inspired by RLVR and Deepseek R1. This repositor…
☆78Updated 2 months ago
PrithivirajDamodaran / SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆30Updated 8 months ago
microsoft / llm-steer-instruct
A method for steering llms to better follow instructions
☆30Updated last week
rungalileo / hallucination-index
Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆107Updated 7 months ago