Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation
β36Sep 4, 2024Updated last year
Alternatives and similar repositories for PII-Detection
Users that are interested in PII-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β22Jan 13, 2025Updated last year
- π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Conβ¦β54Dec 20, 2023Updated 2 years ago
- β21Jun 12, 2024Updated last year
- EmbedDB is an ultra-lightweight vector database designed for rapid prototyping of semantic search and RAG applications. The entire implemβ¦β21Mar 24, 2025Updated last year
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Modelsβ13Mar 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β10Oct 1, 2021Updated 4 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ38Oct 16, 2025Updated 6 months ago
- β30Apr 14, 2025Updated last year
- β12Feb 22, 2023Updated 3 years ago
- β10Nov 12, 2024Updated last year
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts sβ¦β12Sep 11, 2022Updated 3 years ago
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectivβ¦β76Aug 25, 2025Updated 8 months ago
- Toonification of real face images using PyTorch, Stylegan2 and Image-to-Image translationβ13Jun 14, 2022Updated 3 years ago
- Experiment with NVIDIA Triton and Whisperβ15Apr 29, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code from Chris Valasek @nudehaberdasher and Charlie Miller @0xcharlie car hack: http://blog.ioactive.com/2013/08/car-hacking-content.htβ¦β15Oct 1, 2020Updated 5 years ago
- β14May 12, 2025Updated 11 months ago
- A parser combinator in Ruby, with a pretty DSLβ11Jun 25, 2017Updated 8 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific wayβ18Nov 4, 2025Updated 6 months ago
- Evidence-based tools and community collaboration to end algorithmic bias, one data scientist at a time.β35Oct 29, 2023Updated 2 years ago
- Traffic Light recognition using FasterRCNN in Pytorchβ11Jul 23, 2023Updated 2 years ago
- Training PyTorch Faster-RCNN on custom datasetβ14Jun 2, 2021Updated 4 years ago
- Data Science & Machine Learning Project applied to Healthcareβ16Dec 1, 2021Updated 4 years ago
- A TensorFlow 2.0 with eager execution implementation of Pytorch OpenAI few-shot regression toy exampleβ16Jun 24, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Image Segmentation On Custom Dataset Using YOLOv8β19Jan 12, 2023Updated 3 years ago
- DEFCON 30 Car Hacking Village Presentationβ11Sep 11, 2022Updated 3 years ago
- π§ KoBART summarization using pytorchβ13Jun 7, 2023Updated 2 years ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeedβ21May 27, 2024Updated last year
- encrypt files with golang and also decrypt themβ11Oct 9, 2023Updated 2 years ago
- Python port to the normalizer in https://github.com/twitter/twitter-korean-textβ12Apr 26, 2016Updated 10 years ago
- The project aims to detect ships in million pixels satellite images using different object detection algorithms. This makes use of variouβ¦β15Jun 28, 2020Updated 5 years ago
- [NeurIPS'24] Grammar-Aligned Decoding: An algorithm to constrain LLMs' outputs without distorting its original distributionβ28Feb 10, 2025Updated last year
- LUMIN: Your data analysis companion that turns natural language questions into powerful insights through AI-driven visualizations and cleβ¦β17Nov 11, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- π A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation traβ¦β17Apr 17, 2026Updated 2 weeks ago
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization datasetβ14Feb 2, 2025Updated last year
- An AI-powered literature review assistant for researchersβ32Apr 18, 2025Updated last year
- A killer example of how Golang works with Kafka. Used the Sarama (by Shopify) library here.β11Jan 30, 2024Updated 2 years ago
- β24May 7, 2024Updated last year
- β16Mar 23, 2025Updated last year
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Modelsβ19Sep 11, 2024Updated last year