Targoman / PersianWebScraperLinks
An accurate scrapper to scrape popular persian websites, mostly intended to be used as a tool to create large corpora for Persian language.
☆36Updated 5 months ago
Alternatives and similar repositories for PersianWebScraper
Users that are interested in PersianWebScraper are comparing it to the libraries listed below
Sorting:
- Persian ASR dataset☆40Updated last year
- A tool for translating Persian text to IPA (International Phonetic Alphabet).☆66Updated 2 years ago
- ParsBench provides toolkits for benchmarking LLMs based on the Persian language tasks.☆71Updated last month
- fully local RAG system using ollama and faiss☆44Updated 3 months ago
- ☆26Updated last year
- ☆28Updated 3 years ago
- اینجا نکات مهمی که برای رزومهنویسی لازمه رو به اشتراک میگذاریم.☆63Updated last year
- Persian OCR dateset☆76Updated 2 years ago
- A comprehensive dataset for determining gender based on Persian names, enriched with English representations.☆69Updated 2 months ago
- This repository was created using w3schools training methods for my personal practice as well as general use for learning in the JupyterN…☆21Updated 3 weeks ago
- awesome-persian-ai-cheaters☆22Updated 9 months ago
- A collection of Persian stopwords - فهرست کلمات ایست فارسی☆59Updated 3 years ago
- ☆27Updated 2 years ago
- Persian GPT2☆38Updated 4 years ago
- Persian sentiment analysis ( آناکاوی سهش های فارسی | تحلیل احساسات فارسی )☆55Updated 3 years ago
- In this repository, I will document my learning journey in data science. You’ll find resources, insights, and projects that reflect the k…☆15Updated 6 months ago
- Nakamology Website☆46Updated last year
- Iranian/Persian Datasets. دیتاستهای فارسی و ایرانی☆119Updated 2 months ago
- A Finglish to Persian converter.☆84Updated 3 years ago
- Discover optimal docker registry mirror speed for efficient network performance☆89Updated 5 months ago
- Free Software Projects of Tehlug Members☆80Updated 4 months ago
- A FLOSS software for Persian Optical Character Recognition☆90Updated last year
- دستیار واژهگزینیِ فارسی☆67Updated 4 months ago
- Persian raw text - حدود ۸۰ گیگابایت متن خام فارسی☆98Updated 4 years ago
- Simple word-level OCR program for the Persian language based on Recurrent Neural Networks using Pytorch and OpenCV☆19Updated 4 years ago
- Termustat is the online timetabling tool for university students in Iran.☆56Updated 2 weeks ago
- اینجا یه سری داستانک فرضی از تیم مهندسی اوبر میذاریم تا مفاهیم رو در قالب مکالمه این تیم یاد بگیریم.☆27Updated 3 weeks ago
- دیتاست 105هزار کتاب چاپی ایران + بهمراه جزئیات برای داده کاوی☆46Updated 6 years ago
- Persian Bert For Long-Range Sequences☆63Updated 3 years ago
- OCR on unsearchable and large PDF file☆65Updated 5 months ago