ripl-org / sockitLinks
Sockit is a natural-language processing toolkit for modeling structured occupation information and Standard Occupational Classification (SOC) codes in unstructured text from job titles, job postings, and resumes.
☆22Updated 3 months ago
Alternatives and similar repositories for sockit
Users that are interested in sockit are comparing it to the libraries listed below
Sorting:
- The earnings conference call dataset of S&P 500 companies☆150Updated 3 years ago
- Earnings-Call-Dataset / MAEC-A-Multimodal-Aligned-Earnings-Conference-Call-Dataset-for-Financial-Risk-PredictionRepository for CIKM 2020 resource track paper: MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction☆99Updated last year
- ☆34Updated 4 years ago
- Python data Manipulation, visualizations and Natural Language Processing analysis for Wall Street Journal web scraping project #2 for NYC…☆49Updated 4 months ago
- Code for the paper "CAREER: Transfer Learning for Economic Prediction of Labor Sequence Data"☆50Updated last year
- Domain Specific BERT Model for Text Mining in Sustainable Investing☆142Updated 7 months ago
- Agent based-model of the banking system (NetLogo)☆11Updated 7 years ago
- This repo aims to share the core algorithms used in the paper, "Global labor flow network reveals the hierarchical organization and dynam…☆30Updated 6 years ago
- A list of GDELT themes that taken together broadly represent "issues" and media source lists, a way to split GDELT sources into more conc…☆22Updated 6 years ago
- This projects helps scraping and analysing the 10K and 10Q documents filed by publicly traded companies to the SEC.☆22Updated 5 years ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆134Updated 3 months ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆107Updated last year
- A series of Jupyter Notebooks that demonstrate how to scrape data from the S&P Capital IQ Website, provided that you already have access …☆18Updated 6 years ago
- Library for creating causal chains using language models.☆81Updated 3 years ago
- FiNER: Financial Numeric Entity Recognition for XBRL Tagging☆70Updated 3 years ago
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆42Updated 6 years ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆89Updated 2 months ago
- NLP: An Application for Public Policy, PyCon Ireland 2018☆27Updated 3 years ago
- Analyze central bank announcements☆73Updated 2 years ago
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆161Updated 2 months ago
- Fast, flexible name matching for large datasets☆71Updated 5 months ago
- Code for blog posts.☆20Updated 2 years ago
- This repository includes our work on extracting the digital transformation strategy of Fortune 500 companies from earnings calls transcri…☆31Updated 5 years ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆74Updated last year
- Pytorch implementation of "Adapting Text Embeddings for Causal Inference"☆93Updated 4 years ago
- Parse and cluster USPTO patent data. Includes applications, grants, assignments, and maintenance.☆140Updated 2 years ago
- BirdSpotter is a python package which provides an influence and bot detection toolkit for twitter.☆19Updated 4 years ago
- Tensorflow 2 implementation of Causal-BERT☆74Updated 2 years ago
- Measuring Sustainability Reporting using Web Scraping and Natural Language Processing☆37Updated 8 years ago
- Accounting Fraud Detection Using Machine Learning☆159Updated 2 years ago