IlyasHabeeb / Machine_Learning_Focused_CrawlerLinks
A focused web crawler that uses Machine Learning to fetch better relevant results.
β13Updated 7 years ago
Alternatives and similar repositories for Machine_Learning_Focused_Crawler
Users that are interested in Machine_Learning_Focused_Crawler are comparing it to the libraries listed below
Sorting:
- This is an application that automates the process of text analysis with a user-friendly GUI. π± It has been implemented using Python and β¦β40Updated 3 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trendsβ58Updated 2 years ago
- Streamlit component like Microsoft Excelβ24Updated 3 years ago
- β13Updated 3 years ago
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, β¦β87Updated last year
- π TrendsGPT: An AI-powered tool harnessing OpenAI's GPT-4 to automate market research and data analysis. Fetches trending topics from Reβ¦β106Updated 2 years ago
- Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legalβ100Updated 2 years ago
- EcommerceTools is a Python data science toolkit for ecommerce, marketing science, and technical SEO analysis and modelling and was createβ¦β259Updated 2 years ago
- GPT-3.5-trubo + Harvard's Case Access Projectβ18Updated 2 years ago
- Get data about companies from advanced search without the use of APIβ66Updated 6 years ago
- Streamlit-based web app for Streamlit Hackathonβ106Updated 10 months ago
- Knowledge Graph for Legal Documents using Litigation Releases from the SEC website. Classifies into different crimes, extracts relevant iβ¦β82Updated 3 years ago
- A minimalistic web app to generate transciption for audio built using Pythonβ32Updated 2 years ago
- Article on Marqo + GPT3 for news summarisationβ19Updated 2 years ago
- A PoC for a location-aware legal assistant powered by GPT-4 and Gradio.β26Updated 6 months ago
- Explore Multiple Vector Databases and chat with documents on Multiple LLM models, private LLM modelsβ48Updated 2 years ago
- PDF text data extraction web app with OCR for scanned documentsβ95Updated last year
- Lobe is the world's first AI paralegal.β51Updated 3 years ago
- URL articles text summarizer using Web Crawling and NLP (written in Python)β51Updated 5 years ago
- Fully working applications that demonstrate how to use Haystack to implement various use casesβ135Updated 2 months ago
- The Selenium scraper that collected a million stories from Medium.comβ82Updated 7 years ago
- Document Search Engine Toolβ77Updated 3 years ago
- Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.β75Updated 2 years ago
- This is the starting template for a content generator website that uses AI in the background to generate the content.β57Updated 4 years ago
- A LinkedIn lead generation web-scraper script. This project uses Selenium to automate the Chrome Browser & Beautiful Soup to parse the daβ¦β11Updated last year
- Python Streamlit web app utilizing OpenAI (GPT4) and LangChain LLM tools with access to Wikipedia, DuckDuckgo Search, and a ChromaDB withβ¦β73Updated 2 years ago
- Applying the latest advancements in AI and machine learning to solve complex business problems.β83Updated last year
- use GPT3 to generate SQL from textβ15Updated 10 months ago
- The HugChat Streamlit app is an LLM-powered chatbot built using Streamlit and HugChat.β98Updated 2 years ago
- Go from company names / urls to full competitive analysis in minutes using a GPT-4 powered LLM Agentβ36Updated 2 years ago