An accurate scrapper to scrape popular persian websites, mostly intended to be used as a tool to create large corpora for Persian language.
☆37Jan 20, 2025Updated last year
Alternatives and similar repositories for PersianWebScraper
Users that are interested in PersianWebScraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Jul 20, 2021Updated 4 years ago
- ☆47Dec 9, 2023Updated 2 years ago
- ALBERT Persian Playground☆13Jun 12, 2023Updated 2 years ago
- A comprehensive suite of high-level NLP tasks for Persian language☆168Mar 29, 2021Updated 5 years ago
- Transformers, LLM, Prompt Engineering, In-Context Learning, RAG, SFT, RLHF☆10Nov 23, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Benchmarking ChatGPT for Persian: A Preliminary Study☆22Apr 6, 2024Updated 2 years ago
- Standardize your Persian text: Preprocessing, Embedding, and more!☆16Aug 21, 2023Updated 2 years ago
- Both client and server side of the “Vajehh” web application: the search engine for writers.☆47Sep 21, 2024Updated last year
- The all-in-one AI library for Persian, supporting a wide variety of tasks and modalities!☆1,002Mar 9, 2026Updated last month
- Persian Calendar in Python☆39May 24, 2025Updated 10 months ago
- A repo to introduce website that share Data and Dataset about Iran [Useful for Journalist and Researchers]☆46Apr 13, 2025Updated 11 months ago
- PCoQA: Persian Conversational Question Answering Dataset☆21Aug 11, 2024Updated last year
- Build AI agents with serverless architecture☆25Jun 3, 2025Updated 10 months ago
- A well-structured summarization dataset for the Persian language!☆52Jan 20, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fast Neural Machine Translation in C++☆13Feb 21, 2018Updated 8 years ago
- A Python-based Data Analysis Tool for Examining and Visualizing Weather Patterns in Tehran. Utilizes Historical Weather Data to Provide I…☆15Jun 13, 2025Updated 9 months ago
- A simple Java/Kotlin field-level encryption library.☆13May 4, 2025Updated 11 months ago
- Android client for Mahadel project [Deprecated]☆24Aug 25, 2019Updated 6 years ago
- linkedin scraper 🔥 python3 linkedin jobs scraper | education jobs web scraper linkedin scraper profile data linkedin scraper linkedin sc…☆23Apr 27, 2025Updated 11 months ago
- Persian ASR dataset☆42Jul 15, 2023Updated 2 years ago
- Simplifying Persian NLP for Modern Applications☆61Mar 23, 2026Updated 2 weeks ago
- Persian/Farsi text to speech(TTS) training using coqui tts☆200Feb 15, 2025Updated last year
- In this repository, I will document my learning journey in data science. You’ll find resources, insights, and projects that reflect the k…☆13Dec 26, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- fontforge script adding opentype mark feature & Anchor placement for Arabic letters completely automatic☆15Apr 5, 2024Updated 2 years ago
- Persian NLP Toolkit☆1,382Apr 1, 2026Updated last week
- FaBERT: Pre-training BERT on Persian Blogs☆11Aug 6, 2025Updated 8 months ago
- ☆14Nov 2, 2018Updated 7 years ago
- MirasText☆75Aug 12, 2020Updated 5 years ago
- ☆13Mar 2, 2023Updated 3 years ago
- Link previews everywhere.☆13Dec 22, 2022Updated 3 years ago
- Python package to convert pdf to Farsi Word☆136Jun 30, 2024Updated last year
- This repository was created using w3schools training methods for my personal practice as well as general use for learning in the JupyterN…☆24May 29, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- POC of Digikala Semantic Search☆36May 10, 2024Updated last year
- Open information extraction from Persian web☆48Feb 11, 2018Updated 8 years ago
- در مورد قلم آزاد حرف میزنیم☆48Mar 27, 2016Updated 10 years ago
- Fine-Tuned Llama 3 Persian Large Language Model LLM / Persian Llama 3☆36Aug 17, 2025Updated 7 months ago
- ویرایشگر مارکداون برای متون فارسی☆22Feb 4, 2020Updated 6 years ago
- Persian (Farsi) Question Answering Dataset (+ Models)☆213Sep 8, 2021Updated 4 years ago
- List of text corpora (text dataset in Persian) that we used in FarsiYar text-mining tools☆18Jul 16, 2019Updated 6 years ago