SimFin / pdf-crawlerLinks
SimFin's open source PDF crawler
☆128Updated 6 years ago
Alternatives and similar repositories for pdf-crawler
Users that are interested in pdf-crawler are comparing it to the libraries listed below
Sorting:
- Scrapers from a project in 2018. Yelp, Spyfu, Similarweb, Morningstar, Linkedin, Instagram, Inside, Glassdoor, Facebook, Eat24, Doordash,…☆97Updated 6 years ago
- Case Studies on Forensic Accounting using Data Analysis☆53Updated 6 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆16Updated 7 years ago
- Data repository of JSON files that are filed by US Senators on efdsearch.senate.gov where they must report their stock trades. This is th…☆66Updated 4 years ago
- Python script to extract as much structured information as possible from annual/quarterly reports.☆103Updated last year
- Securities and Exchange Commission utility package for dealing with Edgar database. Includes methods to download index files and SEC file…☆37Updated 5 years ago
- Investigate how mutual funds leverage credit derivatives by studying their routine filings to the SEC using NLP techniques 📈🤑☆53Updated 10 months ago
- A US equities trading & settlement calendar command-line tool☆12Updated 3 years ago
- Example code to be used with pyxll☆102Updated last year
- The CorpWatch API uses automated parsers to extract the subsidiary relationship information from Exhibit 21 of companies' 10-K filings wi…☆49Updated 10 months ago
- ☆34Updated 5 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated this week
- Financial modeling with Python and Pandas☆62Updated 4 years ago
- Python application used to download, parse, and extract structured/unstructured data from filings in the SEC Edgar Database (including 10…☆114Updated 3 months ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Updated 3 years ago
- List of companies in the S&P 500 (Standard and Poor's 500).☆72Updated 6 months ago
- SECDatabase.com produced this dataset with the text and detailed numeric information of all financial statements. The Dataset is extracte…☆84Updated 4 years ago
- Export 15 years P&L and BS data from moneycontrol. Correlation analysis of various heads. Mean & Std. Graphs of YoY changes and projectin…☆29Updated 5 years ago
- Scraping Assisted by Learning☆36Updated 2 months ago
- Cap Table and Exit Waterfall Tool, https://foresight.is/cap-table☆39Updated 9 months ago
- Python implementation of Benford's Law tests.☆152Updated 3 years ago
- scrapes names and tickers from magicformulainvesting.com every quarter, adds info to a google sheet which includes stock prices and a lin…☆33Updated 2 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆25Updated last year
- ICR - Automated and Intelligent Company Report Built in Python (by @firmai)☆181Updated 3 years ago
- Populate fillable pdf forms from csv data file☆63Updated 3 years ago
- Extracting sentiment from financial statements using neural networks☆21Updated 7 years ago
- Tools for stock market analysis.☆64Updated 4 years ago
- Scripts to consume and analyze the GDELT project's data☆28Updated 8 years ago
- Library for scraping websites or apis at any scale☆54Updated last year