carrot2 / folder2indexLinks
Creates a Lucene index out of files from a local folder
☆13Updated 11 years ago
Alternatives and similar repositories for folder2index
Users that are interested in folder2index are comparing it to the libraries listed below
Sorting:
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Updated last year
- Tools for creating DBpedia Spotlight Lucene Index☆10Updated 3 years ago
- ☆19Updated 3 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆26Updated 7 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated last week
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 9 years ago
- ☆12Updated 4 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- Zulia Search Engine☆34Updated this week
- opennlp-solr-examples☆10Updated 3 years ago
- NLP framework for JVM languages.☆154Updated 4 years ago
- NEWS: JATE2.0 Beta.11 Released, see details below.☆84Updated 2 years ago
- Apache UIMA Java SDK☆66Updated 3 months ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- KnowledgeStore☆21Updated 7 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 4 years ago
- Mirror of Apache MetaModel Membrane☆16Updated 6 years ago
- SKOS Support for Apache Lucene and Solr☆56Updated 4 years ago
- A set of hacks to setup a dbpedia endpoint through neo4j☆44Updated 13 years ago
- Source code associated with "Apache Solr Essentials" book☆12Updated last year
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 6 years ago
- D3 and Play based visualization for entity-relation graphs, especially for NLP and information extraction☆30Updated 10 years ago
- Language models are open knowledge graphs ( non official implementation )☆13Updated 5 years ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 11 years ago
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆134Updated 2 years ago
- TeXoo – A Zoo of Text Extractors☆18Updated 5 years ago
- Semantic Web related concepts converted to Natural language☆44Updated 8 years ago
- ☆22Updated last year
- Collaborative Synchronized Corpus Annotation Tool☆11Updated 7 years ago