☆91Jun 2, 2016Updated 9 years ago
Alternatives and similar repositories for TextMaps
Users that are interested in TextMaps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18☆170Oct 28, 2021Updated 4 years ago
- Intelligent Web Data Extractor☆74Dec 5, 2022Updated 3 years ago
- An Information Extraction Framework with Deep Learning developed at New York University☆15Oct 27, 2016Updated 9 years ago
- Web content extraction using machine learning☆34Mar 3, 2021Updated 5 years ago
- AI based web-wrapper for web-content-extraction☆102Feb 6, 2023Updated 3 years ago
- Blog crawler for the blogforever project.☆23Jan 31, 2014Updated 12 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Jan 27, 2025Updated last year
- Amazon.com price check, item description & review, and more☆22Mar 2, 2011Updated 15 years ago
- ☆11Oct 10, 2017Updated 8 years ago
- NER toolkit for HTML data☆259May 3, 2024Updated last year
- A twitter bot that applies a random neural style transfer to a random featured photo from Unsplash.☆14Oct 6, 2019Updated 6 years ago
- Extract cyber security entities from unstructured text☆34Apr 24, 2017Updated 8 years ago
- RWA recurrent neural networks☆17Apr 14, 2017Updated 8 years ago
- Framework for evaluating text extraction algorithms implemented as web services☆42Jun 30, 2012Updated 13 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Oct 26, 2017Updated 8 years ago
- Machine Learning Course (INFO 697-03)☆18Dec 3, 2024Updated last year
- ☆18Jun 24, 2017Updated 8 years ago
- [[ ARCHIVED ]] gann(go-approximate-nearest-neighbor) is a library for Approximate Nearest Neighbor Search written in Go☆76Sep 15, 2020Updated 5 years ago
- ☆10Jun 12, 2023Updated 2 years ago
- Hierarchical Universal Modular ANotator☆12Mar 6, 2026Updated 2 weeks ago
- Web Service wrapper for accessing the AmbiverseNLU KG stored in Neo4j☆12Nov 16, 2022Updated 3 years ago
- My implementation of LASER architecture in Fairseq☆12Oct 6, 2020Updated 5 years ago
- Prodigy thing(z)☆12Mar 22, 2018Updated 8 years ago
- C++ Memory allocator for packet queues that free() in roughly the same order that they alloc().☆16Mar 15, 2018Updated 8 years ago
- API - extract a list of keywords from a text.☆18Jul 6, 2017Updated 8 years ago
- ☆22Dec 20, 2019Updated 6 years ago
- This project experiments with the Google NLP Algorithm to evaluate e-commerce product descriptions from an SEO perspective.☆18Jul 2, 2020Updated 5 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- Library for fast text representation and classification.☆10Apr 17, 2022Updated 3 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 8 years ago
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆224Dec 22, 2022Updated 3 years ago
- Scrapy middleware for the autologin☆37Feb 10, 2026Updated last month
- Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"☆655Apr 2, 2023Updated 2 years ago
- Detect duplicated items。内容排重框架。☆11Apr 30, 2015Updated 10 years ago
- Keras implementation of CoVe☆50Sep 17, 2018Updated 7 years ago
- 天池比赛☆10Jul 4, 2021Updated 4 years ago
- Extract relationships between cyber security entities within unstructured text☆24Sep 28, 2018Updated 7 years ago
- A self-hostable rich text editor☆15Sep 23, 2024Updated last year
- Data and all☆14Sep 30, 2019Updated 6 years ago