Extract structured data from HTML and XML documents like a boss.
☆51Dec 6, 2024Updated last year
Alternatives and similar repositories for python-xextract
Users that are interested in python-xextract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scrapy Tutorial☆11Feb 19, 2017Updated 9 years ago
- scrapy-redis-expiredupefilter是基于scrapy-redis修改来的一款scrapy分布式爬虫框架,它支持为请求指纹设置生命周期,请求指纹生命周期结束后将在不影响其他指纹的情况下自动清除。☆10Aug 6, 2019Updated 6 years ago
- 对dbpedia和百科采集而来的语料进行清洗,得到合适的三元组☆15Jun 24, 2017Updated 9 years ago
- Organise your email inbox with rules from a YAML file☆12Mar 25, 2021Updated 5 years ago
- Add macros to your django templates☆34Aug 16, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Install a working 'pythonw' into a virtualenv on Mac OS X☆51Jul 22, 2018Updated 7 years ago
- (Read-only) Generate n-grams☆27Aug 30, 2016Updated 9 years ago
- A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.☆15Feb 9, 2014Updated 12 years ago
- Treat XPath expressions as Python objects☆11Mar 31, 2021Updated 5 years ago
- Elastic Search Code☆23Aug 29, 2021Updated 4 years ago
- 企查查企业分类信息采集☆43Apr 2, 2020Updated 6 years ago
- Powerful Python dict subclass(es) providing aliasing & attribute access☆34Oct 20, 2025Updated 8 months ago
- Distributed task redisqueue(最简单python分布式函数调度框架)☆65Nov 17, 2025Updated 7 months ago
- Allowing an anonymous user to log in by only visiting a URL☆32Sep 9, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple Python configuration utilities.☆18Jul 26, 2022Updated 3 years ago
- Will Search Various Platforms to Confirm An Email Exists.☆10May 29, 2020Updated 6 years ago
- Python3 & Flask connector for Rich Filemanager☆16Apr 30, 2018Updated 8 years ago
- A multi-user web-based CRM for freelancers with an emphasis on flow and momentum☆20Feb 21, 2017Updated 9 years ago
- Demo of JavaScript Obfuscate☆21May 7, 2023Updated 3 years ago
- Toolkit for storing files and attachments in web applications☆165Mar 25, 2026Updated 3 months ago
- Stores email header and body information in JSON format☆12Mar 10, 2016Updated 10 years ago
- Python dependency management via Poetry☆16Nov 28, 2022Updated 3 years ago
- Use pyppeteer from a Scrapy spider☆59Feb 5, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Game 2048 bot☆11Feb 10, 2024Updated 2 years ago
- pip install nb_http_client ,nb_http_client 是 python 史上性能最强的http客户端,比任意请求包快很多倍☆36May 28, 2024Updated 2 years ago
- A simple tool in Python that automates Gmail replies by using Selenium☆17Dec 19, 2016Updated 9 years ago
- GeoDjango buildpack☆18Oct 31, 2015Updated 10 years ago
- Simple, efficient and cross-platform TFIDF-based text summarizer in Rust☆13Apr 12, 2024Updated 2 years ago
- Unassisted clustering algorithms and data structures in Rust☆13Dec 9, 2019Updated 6 years ago
- simple C# portscanner - written for playing around with Metasploit's Execute-Assembly☆10Jul 1, 2023Updated 2 years ago
- Mémoire de Master 2 à l'ERG☆14Aug 16, 2017Updated 8 years ago
- Fork of https://sourceforge.net/p/dunelegacy/code/ci/master/tree/☆18Dec 15, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DEPRECATED - name_tools for Open States and other projects☆19Feb 12, 2020Updated 6 years ago
- A whitelisting HTML filter. Allows only a well-defined subset of HTML to pass through, with URL filtering.☆35Oct 13, 2024Updated last year
- Rust cli tool for running multiple commands in parallel☆22Oct 21, 2024Updated last year
- Preparing DMOZ dataset for my n-Gram LM-based URL classification research☆31Aug 30, 2014Updated 11 years ago
- A Django app that displays pdf files in a grid and lets you read them as flipbooks☆14Mar 3, 2026Updated 3 months ago
- A handy tool for memory problems in Python☆14Nov 12, 2019Updated 6 years ago
- High-speed physical quantities and dimensions in Python☆15Updated this week