ripl-org / sockitLinks
Sockit is a natural-language processing toolkit for modeling structured occupation information and Standard Occupational Classification (SOC) codes in unstructured text from job titles, job postings, and resumes.
☆22Updated last month
Alternatives and similar repositories for sockit
Users that are interested in sockit are comparing it to the libraries listed below
Sorting:
- ☆34Updated 3 years ago
- Domain Specific BERT Model for Text Mining in Sustainable Investing☆141Updated 4 months ago
- Measuring Sustainability Reporting using Web Scraping and Natural Language Processing☆36Updated 8 years ago
- Earnings-Call-Dataset / MAEC-A-Multimodal-Aligned-Earnings-Conference-Call-Dataset-for-Financial-Risk-PredictionRepository for CIKM 2020 resource track paper: MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction☆94Updated last year
- This repository includes our work on extracting the digital transformation strategy of Fortune 500 companies from earnings calls transcri…☆30Updated 4 years ago
- Accounting Fraud Detection Using Machine Learning☆148Updated 2 years ago
- EDGAR10-Q Dataset and implementation of the paper Context NER☆17Updated 2 years ago
- Agent based-model of the banking system (NetLogo)☆11Updated 7 years ago
- A series of Jupyter Notebooks that demonstrate how to scrape data from the S&P Capital IQ Website, provided that you already have access …☆18Updated 6 years ago
- The Financial Audit Data Analytics Paper Collection is an academic paper collection that encompasses data analytics, machine learning, an…☆61Updated 4 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆97Updated last year
- A list of GDELT themes that taken together broadly represent "issues" and media source lists, a way to split GDELT sources into more conc…☆21Updated 6 years ago
- The earnings conference call dataset of S&P 500 companies☆150Updated 3 years ago
- Repository for "Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks"☆24Updated 2 years ago
- Analyze central bank announcements☆72Updated 2 years ago
- Token and sentence level embeddings from FinBERT model (Finance Domain)☆39Updated 2 years ago
- Notebooks for fine-tuning a BERT model and training a LSTM model for financial QA☆36Updated 5 years ago
- Codebase for FOMC-NLP, accepted at ACL 2023 (main)☆62Updated 11 months ago
- This projects helps scraping and analysing the 10K and 10Q documents filed by publicly traded companies to the SEC.☆22Updated 5 years ago
- edgarParser helps you parse and analyze SEC filings from the EDGAR database☆94Updated 2 years ago
- Code for the paper "CAREER: Transfer Learning for Economic Prediction of Labor Sequence Data"☆50Updated last year
- The Harvard USPTO Patent Dataset☆78Updated last year
- Forecasting CPI Inflation with Hierarchical Recurrent Neural Networks☆27Updated last year
- ☆16Updated last year
- Read WRDS datasets remotely (from wrds-cloud) into a Pandas dataframe.☆143Updated this week
- Applying NLP framework to 10-K filings in equity markets☆14Updated 4 years ago
- An analysis of abilities, skills and tech skills data from the O*NET database as well as classification of around 500 random LinkedIn job…☆18Updated 4 years ago
- Fast, flexible name matching for large datasets☆71Updated 2 months ago
- Python library for interacting with EDGAR.☆41Updated last month
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year