o19s / tmdb_dumpLinks
Dump TheMovieDB
☆27Updated 4 years ago
Alternatives and similar repositories for tmdb_dump
Users that are interested in tmdb_dump are comparing it to the libraries listed below
Sorting:
- Demonstration of searching PDF document with Solr, Tika, and Tesseract☆32Updated last year
- A high performance "thin wrapper" HTTP REST server on top of Apache Lucene☆146Updated last year
- Common Crawl fork of Apache Nutch☆40Updated last week
- The LAW next generation crawler.☆90Updated 4 years ago
- Tools and other things for people who work on search relevance & information retrieval☆88Updated 2 years ago
- Zulia Search Engine☆35Updated this week
- API definition, resources and reference implementation of URL Frontiers☆57Updated 2 weeks ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆47Updated 8 years ago
- Entity resolution for Elasticsearch.☆166Updated last month
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated 2 weeks ago
- TeXoo – A Zoo of Text Extractors☆18Updated 5 years ago
- Index Common Crawl archives in tabular format☆125Updated last month
- TheMovieDB in Solr☆22Updated last year
- Towards an open source stack for e-commerce search☆151Updated 4 months ago
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 7 years ago
- Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction from unstructured data and GIS outputters.☆46Updated 2 weeks ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 6 months ago
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆48Updated 2 years ago
- A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.☆43Updated 2 months ago
- Query preprocessor for Java-based search engines (Querqy Core and Lucene implementation)☆189Updated last week
- SolrCloud HAFT is a High Availability and Fault Tolerant Framework for SolrCloud☆30Updated 9 years ago
- Examples of Solr configuration entries for Solr plugins and Conceptual Search\Semantic Search from Simon Hughes Dice.com☆26Updated 9 years ago
- Search relevance evaluation toolkit☆34Updated 3 years ago
- A high performance Apache Solr log reader / parser. I am often faced with many gigs of Solr logs to analyze. This is how I cope.☆34Updated 7 years ago
- Document Ingestion Framework for Search Systems☆37Updated 2 weeks ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex☆19Updated 3 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 4 years ago
- A curated list of Awesome Apache Solr links and resources.☆110Updated 4 years ago
- A natural language search microservice☆95Updated 5 years ago