o19s / tmdb_dumpLinks
Dump TheMovieDB
☆27Updated 4 years ago
Alternatives and similar repositories for tmdb_dump
Users that are interested in tmdb_dump are comparing it to the libraries listed below
Sorting:
- Tools and other things for people who work on search relevance & information retrieval☆87Updated 2 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 8 years ago
- Demonstration of searching PDF document with Solr, Tika, and Tesseract☆32Updated last year
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆194Updated 3 weeks ago
- Common Crawl fork of Apache Nutch☆40Updated last week
- A dataset of multinational first names and last names☆27Updated 2 years ago
- Index Common Crawl archives in tabular format☆124Updated last week
- Entity resolution for Elasticsearch.☆164Updated 2 months ago
- Search relevance evaluation toolkit☆74Updated 3 years ago
- Towards an open source stack for e-commerce search☆150Updated 2 months ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 5 months ago
- A natural language search microservice☆96Updated 4 years ago
- TheMovieDB in Solr☆22Updated last year
- The LAW next generation crawler.☆89Updated 4 years ago
- Solr Query Segmenter for structuring unstructured queries☆22Updated 4 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 4 years ago
- Query preprocessor for Java-based search engines (Querqy Core and Lucene implementation)☆189Updated this week
- MagnetMagnet is a scraper that allows you to scrape torrent information, such as: magnet links, name, size, seeders and leachers, from Ki…☆64Updated 4 years ago
- Querqy for Elasticsearch☆48Updated this week
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆47Updated 2 years ago
- Tool to generate paraphrases of sentences in many languages.☆85Updated 3 years ago
- API definition, resources and reference implementation of URL Frontiers☆54Updated last month
- TeXoo – A Zoo of Text Extractors☆18Updated 5 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated last month
- Converts Youtube URLs to Text with Speech Recognition☆28Updated 3 years ago
- Search relevance evaluation toolkit☆34Updated 3 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- Machine Learning for Information Retrieval☆86Updated 6 months ago
- Common web archive utility code.☆57Updated last week
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆20Updated 5 years ago