MWiechmann / enron_spam_dataLinks
The Enron-Spam dataset preprocessed in a single, clean csv file.
☆60Updated 4 years ago
Alternatives and similar repositories for enron_spam_data
Users that are interested in enron_spam_data are comparing it to the libraries listed below
Sorting:
- Phishing dataset with more than 88,000 instances and 111 features. Web application available at. https://gregavrbancic.github.io/Phishing…☆68Updated 2 years ago
- This project consists of advanced phishing detection using the BERT masked language model.☆27Updated 2 years ago
- LLM for Email Spam Detection☆122Updated 2 years ago
- Training and testing of linguistic passwords models.☆27Updated last year
- Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…☆44Updated last year
- An environment simulation for networks security tasks for development and testing AI based agents. Part of AI Dojo project☆57Updated 2 weeks ago
- PyTorch/HuggingFace Implementation of URLTran: Improving Phishing URL Detection Using Transformers☆37Updated 3 years ago
- Classify data instantly using an LLM☆278Updated last year
- NLP model and tech for cyber security tasks☆86Updated 2 years ago
- LangChain, Llama2-Chat, and zero- and few-shot prompting are used to generate synthetic datasets for IR and RAG system evaluation☆39Updated 2 years ago
- Sentiment analysis in Pytorch on an IMDb dataset.☆68Updated 3 years ago
- Official implementation of the paper "Deep Learning for Hate Speech Detection -A Comparative Study"☆39Updated 4 years ago
- ☆110Updated 2 years ago
- Experiments for automated personality detection using Language Models and psycholinguistic features on various famous personality dataset…☆205Updated 11 months ago
- ☆33Updated last year
- URL phishing detection using Generative Adversarial Network (GAN)☆16Updated 3 years ago
- ☆21Updated 4 years ago
- Creating class-based TF-IDF matrices☆91Updated 3 years ago
- This PyTorch implementation of LayoutLM paper by Microsoft demonstrate the SequenceClassfication task using HuggingFaceTransformers to cl…☆35Updated 3 years ago
- Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with L…☆40Updated 2 years ago
- Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, D…☆97Updated 2 months ago
- pretrained BERT model for cyber security text, learned CyberSecurity Knowledge☆206Updated 2 years ago
- This repository is dedicated to summarizing papers related to large language models with the field of law☆281Updated 3 weeks ago
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆95Updated last month
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆146Updated last year
- A collection of the the best ML and AI news every week (research, news, resources)☆174Updated 6 months ago
- Social Media Mining Toolkit (SMMT) main repository☆136Updated 3 years ago
- Text Summarization for Research Papers☆79Updated 3 years ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆376Updated 10 months ago
- Code for the paper URLNet - Learning a URL Representation with Deep Learning for Malicious URL Detection☆173Updated 5 years ago