MaLeLabTs / RegexGeneratorLinks
This project contains the source code of a tool for generating regular expressions for text extraction: 1. automatically, 2. based only on examples of the desired behavior, 3. without any external hint about how the target regex should look like
☆953Updated 5 years ago
Alternatives and similar repositories for RegexGenerator
Users that are interested in RegexGenerator are comparing it to the libraries listed below
Sorting:
- Code for the paper Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge (EMNLP 2016). http://arxi…☆430Updated 8 years ago
- Official version of TextTeaser.☆627Updated 7 years ago
- Natural Language Engine on WikiData☆436Updated 9 years ago
- Autocomplete - an adult and kid friendly exercise in creating a predictive program☆451Updated 3 years ago
- A small program to detect gibberish using a Markov Chain☆604Updated last year
- Chrome extension: Gives Ctrl+F like find results which include non-exact (fuzzy) matches using string edit-distance and GloVe/Word2Vec. A…☆137Updated 5 years ago
- Creates github index for similar repositories discovery☆192Updated 9 years ago
- A python implementation of the Rapid Automatic Keyword Extraction☆982Updated 5 years ago
- Extract data from websites using basic statistical magic☆505Updated 5 years ago
- ☆185Updated 6 years ago
- A toolkit for making domain-specific probabilistic parsers☆806Updated last year
- Record web requests as they happen and turn them into reusable code in any programming language.☆511Updated 9 years ago
- Fact Extraction from Wikipedia Text☆538Updated 9 years ago
- displaCy.js: An open-source NLP visualiser for the modern web☆344Updated 7 years ago
- Index URLs in Common Crawl☆196Updated 8 years ago
- TextTeaser is an automatic summarization algorithm.☆1,979Updated 7 years ago
- Language-agnostic pretty-printing through machine learning (uh, like, is this possible? YES, apparently).☆473Updated 4 months ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploit…☆744Updated 6 years ago
- Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation bet…☆791Updated 3 years ago
- Automatic text summarization☆243Updated 7 years ago
- Just the facts -- web page content extraction☆1,274Updated 4 months ago
- Keshif - Data Made Explorable (Prototype)☆457Updated 8 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 2 years ago
- An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.☆432Updated last year
- An interactive map of Stack Exchange tags for all sites.☆126Updated 2 years ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆363Updated 2 years ago
- Heuristic based boilerplate removal tool☆803Updated 8 months ago
- Compact Language Detector 2☆882Updated 4 years ago
- A fast and friendly PDF scraping library.☆782Updated 2 years ago