NLP4L / nlp4l
provide preprocessing platform for Lucene indexing and comprehensive Learning-to-Rank modules
☆13Updated 6 years ago
Related projects: ⓘ
- A RankLib based Solr Learning to Rank Plugin☆29Updated 2 years ago
- An open relation extraction system☆46Updated 2 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆84Updated 3 years ago
- ☆18Updated 8 years ago
- Hadoop tools for manipulating ClueWeb collections☆26Updated 8 years ago
- Automatically exported from code.google.com/p/jforests☆67Updated 3 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆49Updated 4 years ago
- Search relevance evaluation toolkit☆73Updated 2 years ago
- ☆16Updated 3 years ago
- Improving the effectiveness Lucene's BM25 (and testing it using Yahoo! Answers and Stack Overflow collections)☆16Updated 2 years ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Updated 4 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 8 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆14Updated 5 years ago
- Word and text similarity measures☆53Updated 2 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆70Updated 4 years ago
- Extractors whose input is a chunked sentence. Includes Relnoun, Nesty, and a scala interface for ReVerb.☆28Updated 6 years ago
- Labeled examples from wiki dumps in Python☆68Updated 8 years ago
- A Java framework to build semantics-aware autoencoder neural network from a knowledge-graph.☆13Updated 6 years ago
- Search a single field with different query time analyzers in Solr☆25Updated 4 years ago
- Automatically exported from code.google.com/p/deepsyntacticparsing☆23Updated 9 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- Hardened Fork of Ranklib learning to rank library☆43Updated last year
- Will store links to known evaluation datasets alongside stats to characterize them☆24Updated 8 years ago
- A Java Wikipedia markup to plain text converter☆37Updated 2 years ago
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 2 years ago
- ☆20Updated 6 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆28Updated 8 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆15Updated 10 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 3 years ago