The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files. Presented at WWW 2025 @ Sydney, Australia (https://dl.acm.org/doi/10.1145/3701716.3715289)
☆504Jul 18, 2025Updated 8 months ago
Alternatives and similar repositories for edgar-crawler
Users that are interested in edgar-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- edgarParser helps you parse and analyze SEC filings from the EDGAR database☆100Feb 27, 2023Updated 3 years ago
- FiNER: Financial Numeric Entity Recognition for XBRL Tagging☆71May 24, 2022Updated 3 years ago
- Python SDK for SEC & EDGAR data — API access and bulk dataset downloads for 20M+ filings, insider trades, 13F holdings, financial stateme…☆291Apr 9, 2026Updated last week
- A Pretrained BERT Model for Financial Communications. https://arxiv.org/abs/2006.08097☆650Jul 23, 2023Updated 2 years ago
- Node.js SDK for SEC & EDGAR data — API access and bulk dataset downloads for 20M+ filings, insider trades, 13F holdings, financial statem…☆295Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Text information from US companies' SEC EDGAR electronic filings☆112Feb 1, 2026Updated 2 months ago
- Extract financial data from the SEC's EDGAR database☆168May 22, 2023Updated 2 years ago
- Functions for extracting commonly used linguistic features from text.☆13Nov 2, 2025Updated 5 months ago
- Python application used to download, parse, and extract structured/unstructured data from filings in the SEC Edgar Database (including 10…☆123Mar 25, 2026Updated 3 weeks ago
- SECDatabase.com produced this dataset with the text and detailed numeric information of all financial statements. The Dataset is extracte…☆85Nov 14, 2021Updated 4 years ago
- Code to manage data related to SEC filings on EDGAR.☆20May 16, 2023Updated 2 years ago
- OpenEDGAR (openedgar.io)☆326Dec 26, 2022Updated 3 years ago
- ☆16Sep 10, 2024Updated last year
- A package to parse SEC XBRL at scale.☆19Nov 25, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for Textual Factor Framework in Cong, Liang and Zhang 2019☆20Jul 17, 2024Updated last year
- Build a local database of company filings pulled from SEC's EDGAR.☆23Oct 24, 2020Updated 5 years ago
- Python script to calculate the Fog Index of a text document.☆16May 11, 2018Updated 7 years ago
- Helper tools to analyze the " Financial Statement Data Sets" from the U.S. securities and exchange commission (sec.gov)☆81Sep 20, 2025Updated 6 months ago
- provide cik to cusip links using 13G and 13D filings☆179Apr 2, 2025Updated last year
- This repository has code to scrape FINRA Trade data☆10Oct 15, 2019Updated 6 years ago
- A python package to parse Securities and Exchange Commission (SEC) Standardized Generalized Markup Language (SGML). Powers the datamule p…☆56Feb 2, 2026Updated 2 months ago
- Financial Sentiment Analysis with BERT☆2,074Sep 9, 2022Updated 3 years ago
- When FLUE Meets FLANG: Benchmarks and Large Pretrained Language Model for Financial Domain☆57Feb 11, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code examples to accompany the paper: "ChatGPT for Textual Analysis? How to use Generative LLMs in Accounting Research"☆65Jan 26, 2025Updated last year
- Preprocessing pipeline notebooks and API supporting text extraction from SEC documents☆151Jan 1, 2024Updated 2 years ago
- A database on VC-backed startups from Ewens and Malenko (2025)☆13Feb 15, 2025Updated last year
- Scraper/Parser of Fundamental Financial Data for US companies☆23Nov 12, 2019Updated 6 years ago
- This projects helps scraping and analysing the 10K and 10Q documents filed by publicly traded companies to the SEC.☆22Nov 3, 2020Updated 5 years ago
- EDGAR10-Q Dataset and implementation of the paper Context NER☆17Sep 29, 2023Updated 2 years ago
- Repository for CIKM 2020 resource track paper: MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction☆104Feb 15, 2024Updated 2 years ago
- Introductory Guide to Using Stata in Empirical Financial Accounting Research☆76Sep 15, 2023Updated 2 years ago
- 10-K's Textual Analysis: A Python package parsing SEC‘s 10-K fillings in all formats(html, txt)☆16Apr 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python APIs for Open PermID☆15Jan 24, 2024Updated 2 years ago
- edgar 10k forms sentiment analysis☆14Jul 9, 2024Updated last year
- ☆25Feb 26, 2025Updated last year
- Code to incorporate non-compete law changes using Stata, R and Python (Ewens and Marx (2017))☆12Jun 27, 2023Updated 2 years ago
- Data Science Research Project: Map poverty using satellite images.☆11Aug 14, 2020Updated 5 years ago
- This is a standalone version of my former ACCTG 579B phd class on Python programming for business research.☆20Aug 22, 2023Updated 2 years ago
- MeatPy☆30Updated this week