hannesrabo / simple-search-engineLinks
Indexing project where we index a portion of the web using spark, hadoop and cassandra.
☆21Updated 5 years ago
Alternatives and similar repositories for simple-search-engine
Users that are interested in simple-search-engine are comparing it to the libraries listed below
Sorting:
- Common crawl extractor☆78Updated last year
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆83Updated 7 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- Index Common Crawl archives in tabular format☆123Updated 2 months ago
- Botpress messaging server☆48Updated 4 months ago
- Tweet Generation with Huggingface☆431Updated last year
- Shoonya - Platform to Annotate and label data at scale.☆56Updated 10 months ago
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆48Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆133Updated 6 months ago
- Official Python bindings for the Axiom API☆30Updated 2 months ago
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆303Updated last year
- Transliteration models for 21 Indic languages☆93Updated last year
- GPU-Powered Topic Modelling☆70Updated 2 years ago
- A python utility for downloading Common Crawl data☆242Updated 2 years ago
- Generate product descriptions, blogs, ads and more using GPT architecture with a single request to TextCortex API a.k.a Hemingwai☆40Updated 2 years ago
- ☆81Updated 2 years ago
- A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.☆355Updated last year
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆315Updated last year
- Dolores is a Python library designed to improve the developer experience when working with pretrained language models. Dolores provides p…☆34Updated 4 years ago
- Do everything from data collection from reddit to training a machine learning model in just two lines of python code!☆82Updated last year
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 4 years ago
- 🤖 Rox AI is a tool that connects Fonoster with Dialogflow ES/CX☆5Updated 2 years ago
- ☆363Updated 8 months ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆179Updated 6 months ago
- A GitHub action to run easily rasa train and rasa test in the CIs.☆34Updated 7 months ago
- A Directory of Online Newspaper Sources for 70+ Languages☆32Updated 4 years ago
- openduplex uses speech-to-text, artificial intelligence and text-to-speech, to call businesses and make appointments for you☆35Updated 2 years ago
- Advanced playground for GPT-3☆409Updated 2 years ago
- A study implementation of Gmail Smart Compose trained with Keras and used in browser with Tensorflow.js☆28Updated 2 years ago
- custom domain link shortener using cloudflare workers + kv☆64Updated 3 years ago