The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files. Presented at WWW 2025 @ Sydney, Australia (https://dl.acm.org/doi/10.1145/3701716.3715289)
β514Jul 18, 2025Updated 9 months ago
Alternatives and similar repositories for edgar-crawler
Users that are interested in edgar-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π Download filings from the SEC EDGAR database using Pythonβ679Mar 30, 2026Updated last month
- Download all companies periodic reports, filings and forms from EDGAR database.β1,378Dec 9, 2025Updated 4 months ago
- Python library to access and analyze SEC Edgar filings, XBRL financial statements, 10-K, 10-Q, and 8-K reportsβ2,095Apr 29, 2026Updated last week
- FiNER: Financial Numeric Entity Recognition for XBRL Taggingβ71May 24, 2022Updated 3 years ago
- Python SDK for SEC & EDGAR data β API access and bulk dataset downloads for 20M+ filings, insider trades, 13F holdings, financial statemeβ¦β296Apr 13, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Pretrained BERT Model for Financial Communications. https://arxiv.org/abs/2006.08097β652Jul 23, 2023Updated 2 years ago
- Node.js SDK for SEC & EDGAR data β API access and bulk dataset downloads for 20M+ filings, insider trades, 13F holdings, financial statemβ¦β300Apr 29, 2026Updated last week
- Text information from US companies' SEC EDGAR electronic filingsβ111Feb 1, 2026Updated 3 months ago
- Tool for the U.S. SEC EDGAR Retrieval and Parsing of Corporate Filingsβ31Apr 3, 2024Updated 2 years ago
- A small library to access files from SEC's edgarβ243Oct 13, 2024Updated last year
- Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual (semantic) structure of the document.β282Mar 25, 2025Updated last year
- Command-line interface (CLI) program for downloading 10-K, 10-K/A, 10-Q, 10-Q/A filings from the SEC EDGAR database.β24Dec 24, 2023Updated 2 years ago
- Functions for extracting commonly used linguistic features from text.β13Nov 2, 2025Updated 6 months ago
- Python application used to download, parse, and extract structured/unstructured data from filings in the SEC Edgar Database (including 10β¦β126Apr 23, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SECDatabase.com produced this dataset with the text and detailed numeric information of all financial statements. The Dataset is extracteβ¦β86Nov 14, 2021Updated 4 years ago
- OpenEDGAR (openedgar.io)β327Dec 26, 2022Updated 3 years ago
- β37Sep 14, 2024Updated last year
- Repository for "Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks"β25Jul 31, 2023Updated 2 years ago
- β16Sep 10, 2024Updated last year
- A package to work with SEC data. Incorporates datamule endpoints.β531Apr 20, 2026Updated 2 weeks ago
- A package to parse SEC XBRL at scale.β19Nov 25, 2025Updated 5 months ago
- Code for Textual Factor Framework in Cong, Liang and Zhang 2019β22Jul 17, 2024Updated last year
- Python script to calculate the Fog Index of a text document.β16May 11, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Helper tools to analyze the " Financial Statement Data Sets" from the U.S. securities and exchange commission (sec.gov)β82Sep 20, 2025Updated 7 months ago
- provide cik to cusip links using 13G and 13D filingsβ178Apr 2, 2025Updated last year
- A python package to parse Securities and Exchange Commission (SEC) Standardized Generalized Markup Language (SGML). Powers the datamule pβ¦β57Apr 21, 2026Updated 2 weeks ago
- Financial Sentiment Analysis with BERTβ2,105Sep 9, 2022Updated 3 years ago
- When FLUE Meets FLANG: Benchmarks and Large Pretrained Language Model for Financial Domainβ57Feb 11, 2025Updated last year
- Code examples to accompany the paper: "ChatGPT for Textual Analysis? How to use Generative LLMs in Accounting Research"β67Jan 26, 2025Updated last year
- Preprocessing pipeline notebooks and API supporting text extraction from SEC documentsβ151Jan 1, 2024Updated 2 years ago
- Python-based parser for parsing XBRL and iXBRL filesβ153Mar 8, 2026Updated last month
- A database on VC-backed startups from Ewens and Malenko (2025)β14Feb 15, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scraper/Parser of Fundamental Financial Data for US companiesβ23Nov 12, 2019Updated 6 years ago
- Analyzing SEC data at scaleβ47Updated this week
- EDGAR10-Q Dataset and implementation of the paper Context NERβ17Sep 29, 2023Updated 2 years ago
- Repository for CIKM 2020 resource track paper: MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Predictionβ106Feb 15, 2024Updated 2 years ago
- Introductory Guide to Using Stata in Empirical Financial Accounting Researchβ76Sep 15, 2023Updated 2 years ago
- Python APIs for Open PermIDβ15Jan 24, 2024Updated 2 years ago
- edgar 10k forms sentiment analysisβ14Jul 9, 2024Updated last year