MaLeLabTs / RegexGenerator
This project contains the source code of a tool for generating regular expressions for text extraction: 1. automatically, 2. based only on examples of the desired behavior, 3. without any external hint about how the target regex should look like
☆948Updated 4 years ago
Alternatives and similar repositories for RegexGenerator:
Users that are interested in RegexGenerator are comparing it to the libraries listed below
- Code for the paper Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge (EMNLP 2016). http://arxi…☆429Updated 7 years ago
- SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm☆3,204Updated 2 weeks ago
- A toolkit for making domain-specific probabilistic parsers☆799Updated 4 months ago
- A python implementation of the Rapid Automatic Keyword Extraction☆971Updated 4 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆199Updated 6 years ago
- 🦆 Contextually-keyed word vectors☆1,638Updated 11 months ago
- extract text from any document. no muss. no fuss.☆3,972Updated 2 months ago
- Extract data from websites using basic statistical magic☆505Updated 4 years ago
- A small program to detect gibberish using a Markov Chain☆603Updated last year
- Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation bet…☆774Updated 2 years ago
- Official version of TextTeaser.☆622Updated 6 years ago
- Learning framework for program property prediction☆217Updated 3 years ago
- The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploit…☆741Updated 5 years ago
- displaCy.js: An open-source NLP visualiser for the modern web☆344Updated 6 years ago
- Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.☆1,064Updated 2 years ago
- Just the facts -- web page content extraction☆1,258Updated 7 months ago
- Creates github index for similar repositories discovery☆192Updated 8 years ago
- Natural Language Engine on WikiData☆436Updated 8 years ago
- Fact Extraction from Wikipedia Text☆530Updated 8 years ago
- Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby☆600Updated 7 years ago
- Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jo…☆257Updated 5 years ago
- Summarizes news articles☆1,165Updated 3 years ago
- Multilingual word vectors in 78 languages☆1,195Updated last year
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,097Updated last month
- Visualization Tool for Data Exploration☆1,457Updated last year
- a python library for parsing unstructured western names into name components.☆599Updated 3 months ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆1,842Updated 7 months ago
- Multilingual text (NLP) processing toolkit☆2,322Updated last year
- Automatic Web Article Summarizer☆414Updated 3 years ago
- MITIE: library and tools for information extraction☆2,932Updated last month