python based crawler to mine pdfs from websites and extracting useful features for data extraction
☆21Oct 4, 2020Updated 5 years ago
Alternatives and similar repositories for pdf-miner
Users that are interested in pdf-miner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An offline IDE for C++, although similar to ideone.com, but ensures that your code doesn't fall into wrong hands :p☆16Feb 18, 2016Updated 10 years ago
- network security scripts and tips....☆14Oct 23, 2018Updated 7 years ago
- ☆22May 5, 2020Updated 6 years ago
- ☆20Oct 26, 2020Updated 5 years ago
- ☆25Jan 6, 2016Updated 10 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Flappy Bird Automation using RL and Servo☆30Sep 2, 2016Updated 9 years ago
- A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to…☆24Jul 30, 2014Updated 11 years ago
- Command Line Utilities For iOS Development☆33Oct 26, 2021Updated 4 years ago
- ☆10Mar 31, 2017Updated 9 years ago
- Writeups for different CTF challenges☆71Mar 25, 2025Updated last year
- A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in w…☆68Feb 9, 2018Updated 8 years ago
- Boilerplate to port your bookmarklet to a Firefox Add-on☆12Dec 6, 2015Updated 10 years ago
- A python script to download course contents (videos, ppt, pdf, etc) from coursera.org☆57Jan 23, 2014Updated 12 years ago
- Example for convert bookmarklet to chrome extension package☆21Jul 1, 2010Updated 15 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tools of the trade☆85Sep 6, 2022Updated 3 years ago
- you idiots☆11Oct 30, 2024Updated last year
- An AWS S3 file manager. It supports keyword search, upload, preview video and archive files into a zip then download it.☆11Mar 20, 2023Updated 3 years ago
- AI That Does File Shit For You!☆14Jan 21, 2026Updated 4 months ago
- Online version of the famous game "Chain Reaction"!☆38Oct 4, 2020Updated 5 years ago
- Parse or build bookmark's files☆10Mar 16, 2018Updated 8 years ago
- Python scripts to download course videos off CDEEP☆12Oct 20, 2015Updated 10 years ago
- Download musique from mixcloud☆11Aug 12, 2018Updated 7 years ago
- This is the official code used for WAT 2017 Description Paper titled A Bag of Useful Tricks for Practical Neural Machine Translation: Emb…☆12Oct 24, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- a basic peniscoin miner in python☆11Jun 12, 2014Updated 12 years ago
- ☆11Nov 13, 2020Updated 5 years ago
- Simple Python web crawler that looks through websites for media files (mp3, wma, aac.) and extracts their metadata☆20Jul 11, 2011Updated 14 years ago
- Exploiting cameras with a very distinctive HTTP Server header of "JAWS/1.0".☆11Jan 11, 2023Updated 3 years ago
- ☆10Mar 7, 2017Updated 9 years ago
- Dockerfile for machine learning environment(scikit-learn, chainer, gensim, tensorflow, jupyter)☆10Aug 16, 2018Updated 7 years ago
- Companion repo for my blog post about Wolfram's elementary cellular automata☆10Feb 11, 2017Updated 9 years ago
- A GUI for the scrypt-based cpuminer☆29Jun 28, 2019Updated 6 years ago
- Crowdsourced event coverage system developed by Pittsburgh IndyMedia during the G20. Incorporates twitter, flickr, google maps, podcast…☆25Feb 15, 2010Updated 16 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Portable version of the Pentaho Data Integration (Kettle) application, for Windows☆13Jun 7, 2024Updated 2 years ago
- A python interface to WolframAlpha.☆15Apr 4, 2010Updated 16 years ago
- Chrome extension for reveal pages from Scribd documents viewer.☆15Apr 30, 2022Updated 4 years ago
- YourDebrid is a fully open-source debrid service written in D☆18Sep 8, 2018Updated 7 years ago
- Convert JavaScript files to bookmarklets☆20May 9, 2020Updated 6 years ago
- Procedural world generation in Unity (for GEDE at Reykjavík University 2016)☆10Apr 3, 2016Updated 10 years ago
- Junction Tree Variational AutoEncoder Implementation Attempt☆11Jun 21, 2018Updated 7 years ago