nkthiebaut/zeugma

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nkthiebaut/zeugma)

nkthiebaut / zeugma

📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible with scikit-learn Pipelines. 🛠

☆63

Alternatives and similar repositories for zeugma

Users that are interested in zeugma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AntoineSimoulin / pytree
View on GitHub
Implementation of tree-structured neural networks in PyTorch.
☆15Nov 15, 2021Updated 4 years ago
swabhs / notebooks_for_aflite
View on GitHub
IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".
☆16Aug 14, 2020Updated 5 years ago
stephantul / piecelearn
View on GitHub
Learning BPE embeddings by first learning a segmentation model and then training word2vec
☆19Dec 18, 2022Updated 3 years ago
sabithsn / APPDIA-Discourse-Style-Transfer
View on GitHub
Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…
☆13Sep 8, 2022Updated 3 years ago
sjtuprog / fox-news-comments
View on GitHub
annotated hateful speech
☆24Apr 6, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SharlSherif / Reddit-Profile-Scraper
View on GitHub
This project scrapes the entire public history of a Reddit user given their username
☆15Dec 8, 2022Updated 3 years ago
kelichiu / GPT3-hate-speech-detection
View on GitHub
Using GPT-3 to detect hate speech that contains sexist and racist content
☆24Nov 11, 2025Updated 8 months ago
Quantmetry / grand-debat
View on GitHub
Initier la mise à disposition, pour tout citoyen, de techniques d’Intelligence Artificielle destinées à appréhender le nombre important d…
☆11Aug 20, 2024Updated last year
spyysalo / wiki-bert-pipeline
View on GitHub
Generate BERT vocabularies and pretraining examples from Wikipedias
☆17May 11, 2020Updated 6 years ago
Zinxira / tlvmc-parkinsons-fog-prediction-4th-place-solution
View on GitHub
☆11Aug 3, 2023Updated 2 years ago
Kanaries / gw-dsl-parser
View on GitHub
Generate SQL from Graphic Walker visualization DSL
☆13Feb 23, 2024Updated 2 years ago
Cheukting / Style-mimicking-text-generator
View on GitHub
Using word embedding and LSTM to train a Neural Network to generate text mimicking style of the training text
☆13Jun 9, 2018Updated 8 years ago
nstawfik / MedSentEval
View on GitHub
☆11Nov 19, 2020Updated 5 years ago
Yinghao-Li / CHMM-ALT
View on GitHub
Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"
☆32Jun 20, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OrigamiDream / lion-tf
View on GitHub
Lion - EvoLved Sign Momentum w/ New Optimizer API in TensorFlow 2.11+
☆10Feb 16, 2023Updated 3 years ago
GateNLP / semeval2019-hyperpartisan-bertha-von-suttner
View on GitHub
SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution
☆23Aug 15, 2019Updated 6 years ago
BaderLab / Transfer-Learning-BNER-Bioinformatics-2018
View on GitHub
This repository contains supplementary data, and links to the model and corpora used for the paper: Transfer learning for biomedical name…
☆37Mar 5, 2019Updated 7 years ago
VenkteshV / ML_refresher_course_2022
View on GitHub
☆14Dec 30, 2022Updated 3 years ago
AbdualimovTP / datret
View on GitHub
Tensorflow implementation for structured tabular data
☆11Jan 21, 2023Updated 3 years ago
SALT-NLP / mic
View on GitHub
Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"
☆21Jul 18, 2023Updated 3 years ago
alexbrandsen / jsonl2bio
View on GitHub
Script that converts JSONL output from Doccano to the BIO format
☆10Jul 5, 2019Updated 7 years ago
tommasoc80 / AbuseEval
View on GitHub
Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"
☆19Sep 23, 2023Updated 2 years ago
ChristofHenkel / kaggle-birdclef24-3rd-place-solution-dieter
View on GitHub
☆13Jun 24, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
hadarishav / Ruddit
View on GitHub
This repo contains the dataset and description for Ruddit and its variants.
☆36Feb 13, 2022Updated 4 years ago
tomaarsen / SpanMarkerNER
View on GitHub
SpanMarker for Named Entity Recognition
☆477Apr 10, 2026Updated 3 months ago
ryanzhumich / AESLC
View on GitHub
Annotated Enron Subject Line Corpus (AESLC)
☆24Feb 2, 2023Updated 3 years ago
Beomi / exbert-transformers
View on GitHub
exBERT on Transformers🤗
☆10Jun 14, 2021Updated 5 years ago
argilla-io / biome-text
View on GitHub
Custom Natural Language Processing with big and small models 🌲🌱
☆66Sep 8, 2021Updated 4 years ago
HannahKirk / Hatemoji
View on GitHub
Testing and training detection models for emoji-based hate speech.
☆25May 15, 2022Updated 4 years ago
andrecedras / spatial-optimization
View on GitHub
This notebook illustrates the use of the Google Maps API to determine the optimum route given a list of addresses
☆11Nov 26, 2018Updated 7 years ago
Kevin-McIsaac / cmorlet-tensorflow
View on GitHub
A TensorFlow implementation of the Continous Wavelet Transform based on the complex Morlet wavelet.
☆13Aug 26, 2021Updated 4 years ago
shaoxia57 / Bias_in_Gendered_Languages
View on GitHub
This is a repo for the EMNLP 19 Paper on gender bias in gendered languages.
☆23Sep 6, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
brianyiktaktsui / DEEP_NLP
View on GitHub
☆11Aug 14, 2020Updated 5 years ago
vwoloszyn / diaa
View on GitHub
Inter-annotator agreement for Doccano
☆28May 3, 2020Updated 6 years ago
rednafi / fly-fastapi
View on GitHub
Deploying a simple FastAPI app to Fly.io >> https://fly-fastapi.fly.dev/docs <<
☆14Oct 2, 2023Updated 2 years ago
yandexdataschool / gumbel_lstm
View on GitHub
Experiments with binary LSTM using gumbel-sigmoid
☆32May 28, 2020Updated 6 years ago
nikitautiu / learnhtml
View on GitHub
Web content extraction using machine learning
☆34Mar 3, 2021Updated 5 years ago
taspinar / numerical-mooc
View on GitHub
A course in numerical methods with Python for engineers and scientists: currently 5 learning modules, with student assignments.
☆10Dec 6, 2017Updated 8 years ago
alexeyev / glyphnet-pytorch
View on GitHub
Сracking Egyptologist's MNIST: PyTorch implementation of the Glyphnet model introduced in "A Deep Learning Approach to Ancient Egyptian H…
☆17Sep 6, 2022Updated 3 years ago