MaLeLabTs/RegexGenerator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MaLeLabTs/RegexGenerator)

MaLeLabTs / RegexGenerator

This project contains the source code of a tool for generating regular expressions for text extraction: 1. automatically, 2. based only on examples of the desired behavior, 3. without any external hint about how the target regex should look like

☆953

Alternatives and similar repositories for RegexGenerator

Users that are interested in RegexGenerator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nicholaslocascio / deep-regex
View on GitHub
Code for the paper Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge (EMNLP 2016). http://arxi…
☆428May 31, 2017Updated 9 years ago
CogComp / illinois-sl
View on GitHub
A general-purpose Java library for performing structured learning.
☆23Jul 5, 2022Updated 4 years ago
devongovett / regexgen
View on GitHub
Generate regular expressions that match a set of strings
☆3,426Feb 15, 2024Updated 2 years ago
sean-chester / generalised-brown
View on GitHub
C++ implementation of Generalised Brown clustering and python scripts for feature generation
☆41Apr 8, 2016Updated 10 years ago
julianthome / autorex
View on GitHub
A dk.brics FSM to regular-expression-string converter
☆10Jul 12, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
interrogator / risk
View on GitHub
NYT Risk Semantics Project
☆12Mar 5, 2016Updated 10 years ago
cs-au-dk / dk.brics.automaton
View on GitHub
dk.brics.automaton - finite-state automata and regular expressions for Java
☆246Sep 14, 2025Updated 10 months ago
nathanmerrill / wordsbysyllables
View on GitHub
Contains common english words categorized by syllables
☆14Mar 4, 2015Updated 11 years ago
chokkan / simstring
View on GitHub
SimString
☆114May 16, 2021Updated 5 years ago
bwagner / wordhierarchy
View on GitHub
This project provides a word hierarchy builder. It builds a tree out of a set of words which can then be navigated by a WordProcessor to …
☆20Nov 10, 2024Updated last year
semanticize / st
View on GitHub
Semanticizest: dump parser and client
☆20May 11, 2016Updated 10 years ago
libfirm / bytecode2firm
View on GitHub
Convert Java bytecode to firm IR
☆18Feb 20, 2017Updated 9 years ago
hohoCode / cgx
View on GitHub
UltraFast GPU Grammar eXtractor for Machine Translation (He et al., TACL 2015 & NAACL 2013)
☆12Jun 19, 2015Updated 11 years ago
KIZI / LinkedHypernymsDataset
View on GitHub
☆14Aug 24, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Data2Semantics / nodes
View on GitHub
A general purpose graph library
☆11Jun 21, 2018Updated 8 years ago
shilad / macademia
View on GitHub
The Macademia website visualizes connections between research interests: http://macademia.macalester.edu
☆15Sep 18, 2015Updated 10 years ago
vi3k6i5 / flashtext
View on GitHub
Extract Keywords from sentence or Replace keywords in sentences.
☆5,716Apr 13, 2025Updated last year
j-magnolia / datafsm
View on GitHub
Learning Finite State Machine Models from Data with a Genetic Algorithm
☆11Dec 1, 2025Updated 7 months ago
facebookresearch / fastText
View on GitHub
Library for fast text representation and classification.
☆26,552Mar 22, 2024Updated 2 years ago
google / re2j
View on GitHub
linear time regular expression matching in Java
☆1,255May 22, 2026Updated last month
OpenRefine / OpenRefine
View on GitHub
OpenRefine is a free, open source power tool for working with messy data and improving it
☆11,917Updated this week
LearnLib / automatalib
View on GitHub
A free, open-source Java library for automata, graphs, and transition systems
☆102Updated this week
adiyoss / StructED
View on GitHub
Risk Minimization Algorithms in Structured Prediction (JMLR 2016)
☆13Jan 26, 2017Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ag-sc / lemon.dbpedia
View on GitHub
lemon lexicon for DBpedia
☆28Oct 13, 2015Updated 10 years ago
coastalcph / rungsted
View on GitHub
Fast structured perceptron sequential labeler
☆15Dec 8, 2015Updated 10 years ago
velvia / cassandra-gdelt
View on GitHub
Experiments with the GDELT dataset and Cassandra schemas.
☆25Feb 9, 2016Updated 10 years ago
edefazio / varcode
View on GitHub
Generate, compile and run .java source dynamically at runtime
☆11Apr 23, 2019Updated 7 years ago
dowobeha / Gale_and_Church_1993
View on GitHub
Bilingual sentence aligner (Gale & Church, 1993)
☆14Jan 8, 2026Updated 6 months ago
leondz / entity_recognition
View on GitHub
framework for doing NER and other types of entity recognition, in Python
☆68Jun 21, 2022Updated 4 years ago
knowitall / openregex
View on GitHub
An efficient and flexible token-based regular expression language and engine.
☆76Mar 20, 2014Updated 12 years ago
anuzzolese / oke-challenge
View on GitHub
☆18Jun 24, 2017Updated 9 years ago
lasigeBioTM / MER
View on GitHub
Minimal Named-Entity Recognizer (MER)
☆58Sep 18, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ahungry / sluglisp
View on GitHub
Like Quicklisp, only slower (a web GUI based Quicklisp for searching the projects)
☆14Aug 2, 2017Updated 8 years ago
nes1983 / tree-regex
View on GitHub
A linear regular expression engine that produces parse trees (ASTs).
☆33Dec 30, 2014Updated 11 years ago
doccano / doccano
View on GitHub
Open source annotation tool for machine learning practitioners.
☆10,709Apr 14, 2026Updated 3 months ago
clearnlp / clearnlp
View on GitHub
Fast and robust NLP components implemented in Java.
☆55Oct 13, 2020Updated 5 years ago
oir / deep-recursive
View on GitHub
Implementation of a deep recursive net over binary parse trees (code for NIPS2014 paper)
☆28Feb 6, 2015Updated 11 years ago
fnl / segtok
View on GitHub
Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…
☆170Dec 15, 2021Updated 4 years ago
JonathanRaiman / wikipedia_ner
View on GitHub
Labeled examples from wiki dumps in Python
☆67Aug 8, 2016Updated 9 years ago