Sockit is a natural-language processing toolkit for modeling structured occupation information and Standard Occupational Classification (SOC) codes in unstructured text from job titles, job postings, and resumes.
☆22Oct 13, 2025Updated 4 months ago
Alternatives and similar repositories for sockit
Users that are interested in sockit are comparing it to the libraries listed below
Sorting:
- An analysis of abilities, skills and tech skills data from the O*NET database as well as classification of around 500 random LinkedIn job…☆19Nov 27, 2020Updated 5 years ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆74Jun 27, 2024Updated last year
- ☆11Dec 1, 2023Updated 2 years ago
- ☆58Oct 4, 2025Updated 5 months ago
- Tutorial on time-series forcasting with scikit-learn☆33Mar 14, 2023Updated 2 years ago
- Financial Modelling and Computation☆11Nov 19, 2017Updated 8 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- The code of APP, Scalable Graph Embedding for Asymmetric Proximity. Zhou, Chang and Liu, Yuqiong and Liu, Xiaofei and Liu, Zhongyi and Ga…☆10Nov 11, 2018Updated 7 years ago
- https://duyet.github.io/related-skills-visualization/index.html☆11Jul 11, 2020Updated 5 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 2 years ago
- A comprehensive e-commerce database for testing and developing AI agents☆11Apr 18, 2025Updated 10 months ago
- A graphical representation of relations between programming languages, technologies and skills in demand, based on tens of thousands of j…☆13Nov 25, 2023Updated 2 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- A context-aware embedding similarity score☆11Aug 23, 2023Updated 2 years ago
- ☆10Apr 7, 2023Updated 2 years ago
- An analysis of arXiv data, in terms of AI and Deep Learning research☆11Feb 19, 2021Updated 5 years ago
- Group project for the WorldQuant University module, risk management.☆13Feb 3, 2019Updated 7 years ago
- Awesome cheatsheets for Data Science☆12Sep 16, 2019Updated 6 years ago
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆15Feb 24, 2024Updated 2 years ago
- This repository contains the implementation code for paper: Mixup Your Own Pairs☆12Oct 1, 2023Updated 2 years ago
- Heuristics for cardinality constrained portfolio optimisation☆12Nov 3, 2018Updated 7 years ago
- Pytorch implementation of standard metrics for clustering☆10Mar 21, 2023Updated 2 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Dec 27, 2023Updated 2 years ago
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"☆14Sep 9, 2025Updated 5 months ago
- People Analytics Datasets☆12Jun 7, 2021Updated 4 years ago
- Gathers machine learning and deep learning models for Stock forecasting including trading bots and simulations☆10Apr 18, 2022Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Feb 2, 2026Updated last month
- Monthly air passengers and landings at San Francisco International Airport (SFO)☆12Mar 16, 2023Updated 2 years ago
- Network analysis with Input-Output Matrix. Project developed as a conclusion of my graduation in economics where I explored the use of ne…☆14Oct 13, 2019Updated 6 years ago
- Repository to create CCKGs from the paper "Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-…☆11May 23, 2025Updated 9 months ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- A minimal working example of using undetected-chromedriver on AWS Lambda with Selenium and Docker☆19Aug 12, 2025Updated 6 months ago
- Package for making crosswalks among different Occupational codes☆14Oct 3, 2023Updated 2 years ago
- ☆12Jan 2, 2024Updated 2 years ago