MaLeLabTs / RegexGeneratorLinks
This project contains the source code of a tool for generating regular expressions for text extraction: 1. automatically, 2. based only on examples of the desired behavior, 3. without any external hint about how the target regex should look like
☆951Updated 4 years ago
Alternatives and similar repositories for RegexGenerator
Users that are interested in RegexGenerator are comparing it to the libraries listed below
Sorting:
- Code for the paper Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge (EMNLP 2016). http://arxi…☆430Updated 8 years ago
- A small program to detect gibberish using a Markov Chain☆603Updated last year
- Java API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Sci…☆817Updated 6 months ago
- Learning framework for program property prediction☆217Updated 3 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆253Updated last year
- Work in progress transmit from Google Code☆1,116Updated 7 years ago
- DeepDive☆1,965Updated 2 years ago
- Fact Extraction from Wikipedia Text☆535Updated 9 years ago
- TextTeaser is an automatic summarization algorithm.☆1,979Updated 7 years ago
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.☆758Updated 7 years ago
- Semantic Parser with Execution☆835Updated 2 years ago
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆542Updated 3 years ago
- Just the facts -- web page content extraction☆1,266Updated 11 months ago
- The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploit…☆743Updated 6 years ago
- A toolkit for making domain-specific probabilistic parsers☆802Updated 8 months ago
- A library for reading text files over multiple cores.☆1,055Updated last year
- Chrome extension: Gives Ctrl+F like find results which include non-exact (fuzzy) matches using string edit-distance and GloVe/Word2Vec. A…☆137Updated 4 years ago
- MITIE: library and tools for information extraction☆2,942Updated 5 months ago
- SLING - A natural language frame semantics parser☆1,932Updated 4 years ago
- Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statisti…☆1,086Updated last year
- Web-Scale Open Information Extraction☆543Updated 6 years ago
- Log-based transactional graph engine☆1,146Updated 7 months ago
- A tool to create animated graph visualizations, based on graphviz.☆496Updated last year
- Official version of TextTeaser.☆624Updated 6 years ago
- Web Content Extraction Through Machine Learning☆185Updated 11 years ago
- Sandboxed Execution Environment☆819Updated 4 years ago
- Deprecated in favor of https://github.com/facebook/duckling☆1,325Updated 6 years ago
- Index URLs in Common Crawl☆194Updated 7 years ago
- Creates github index for similar repositories discovery☆192Updated 8 years ago
- Tree edit distance using the Zhang Shasha algorithm☆449Updated 4 years ago