IlyasHabeeb / Machine_Learning_Focused_Crawler
A focused web crawler that uses Machine Learning to fetch better relevant results.
β13Updated 5 years ago
Related projects β
Alternatives and complementary repositories for Machine_Learning_Focused_Crawler
- This is an application that automates the process of text analysis with a user-friendly GUI. π± It has been implemented using Python and β¦β34Updated 2 years ago
- Document Search Engine Toolβ71Updated last year
- GPT-3.5-trubo + Harvard's Case Access Projectβ16Updated last year
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)β14Updated 2 years ago
- Python Script for Copywriters to Gather Data from Competing Content and Find Keyword Overlapβ11Updated 2 years ago
- AI models for automatic job application pipeline (user CV, job description analysis (customized NER/SpaCy) and artificial cover letter geβ¦β31Updated 5 months ago
- Google Search Results Pages Dashboardβ36Updated last year
- URL articles text summarizer using Web Crawling and NLP (written in Python)β43Updated 3 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trendsβ56Updated 9 months ago
- Project automates AI news gathering and Blog post Writing. Our AI agent collects insights and news about any topic From Internet and Wriβ¦β16Updated 7 months ago
- β23Updated last year
- Lobe is the world's first AI paralegal.β43Updated last year
- Streamlit application to keep GPT3 Experimentation saneβ23Updated 3 years ago
- Fully working applications that demonstrate how to use Haystack to implement common NLP use casesβ107Updated last week
- π Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!β19Updated last year
- π A contracts clause summarization system using LLM and vector databaseβ13Updated 8 months ago
- LegalLens is an AI legal assistant that delivers accurate legal information based on user queries and jurisdictions. Using OpenAI's GPT-4β¦β22Updated last year
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, Rβ¦β14Updated 2 years ago
- Building a Job Datasetβ21Updated 2 years ago
- β22Updated 3 years ago
- Go from company names / urls to full competitive analysis in minutes using a GPT-4 powered LLM Agentβ26Updated last year
- Python libraries for extracting from data sources like Rechtspraak, ECHR, Cellarβ12Updated 3 weeks ago
- Scraping Wikipedia by combining LangChain's agents and tools with OpenAI's LLMs and function callingβ26Updated 10 months ago
- PDF text data extraction web app with OCR for scanned documentsβ79Updated 5 months ago
- Toolkit to get the most out of your OpenAI Accountβ12Updated 11 months ago
- Zyte Automatic Extraction integration for Scrapyβ55Updated 2 years ago
- Integrate Watson Studio and Watson Campaign Automation to tailor your target audience for effective campaignsβ11Updated 2 years ago
- Component to create custom sidebar for streamlitβ12Updated 5 months ago
- A python package for finding e-mails, checking deliverability and more.β48Updated 6 months ago