ripl-org/sockit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ripl-org/sockit)

ripl-org / sockit

Sockit is a natural-language processing toolkit for modeling structured occupation information and Standard Occupational Classification (SOC) codes in unstructured text from job titles, job postings, and resumes.

☆25

Alternatives and similar repositories for sockit

Users that are interested in sockit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RenaissancePhilanthropy / careernet-data
View on GitHub
Career navigation QA dataset
☆20Mar 26, 2026Updated 4 months ago
dan-grant-hunter / ONET_Analysis_Classification
View on GitHub
An analysis of abilities, skills and tech skills data from the O*NET database as well as classification of around 500 random LinkedIn job…
☆20Nov 27, 2020Updated 5 years ago
aeturrell / occupationcoder
View on GitHub
Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.
☆76Jun 27, 2024Updated 2 years ago
AbbasRafatpanah / ecommerce-database
View on GitHub
A comprehensive e-commerce database for testing and developing AI agents
☆10Apr 18, 2025Updated last year
bank-of-england / occupationcoder
View on GitHub
Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.
☆37Jun 17, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mruiyangyou / RAG-enhanced-LLM-Agent-for-text-to-SQL-generation
View on GitHub
☆14Apr 7, 2024Updated 2 years ago
Jackal08 / sa_risk_management
View on GitHub
Group project for the WorldQuant University module, risk management.
☆13Feb 3, 2019Updated 7 years ago
duyet / related-skills-visualization
View on GitHub
https://duyet.github.io/related-skills-visualization/index.html
☆11Jul 11, 2020Updated 6 years ago
latynt / ans
View on GitHub
Arabic News Stance Corpus
☆11Feb 5, 2021Updated 5 years ago
hieudx149 / X-RetroMAE
View on GitHub
Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder
☆10Mar 16, 2023Updated 3 years ago
google / t5patches
View on GitHub
T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.
☆12May 31, 2024Updated 2 years ago
glemaitre / datascience_starter_course
View on GitHub
☆11Dec 1, 2023Updated 2 years ago
junhua / EPIC
View on GitHub
EPIC: a large collection of over 30 million epidemic-related tweets
☆12Jul 28, 2020Updated 6 years ago
amazon-science / unique-batches
View on GitHub
☆11Aug 13, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
RamiKrispin / sfo
View on GitHub
Monthly air passengers and landings at San Francisco International Airport (SFO)
☆12Mar 16, 2023Updated 3 years ago
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
CreaLabs / Enhanced-BGE-M3-with-CLP-and-MoE
View on GitHub
This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…
☆11Dec 27, 2024Updated last year
seanjtaylor / treasury-data
View on GitHub
Scraper/Parser for Daily Treasury Statements
☆13Apr 14, 2019Updated 7 years ago
thunlp / CSS-LM
View on GitHub
CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models
☆11Jul 1, 2023Updated 3 years ago
LUMIA-Group / LoT-insts
View on GitHub
The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…
☆12Feb 19, 2023Updated 3 years ago
ShaojieJiang / CT-Loss
View on GitHub
The contrastive token loss function for reducing generative repetition of autoregressive neural language models.
☆13May 11, 2022Updated 4 years ago
emunozlorenzo / MyCheatSheets
View on GitHub
Awesome cheatsheets for Data Science
☆12Sep 16, 2019Updated 6 years ago
ychen-stat-ml / kernel-adapters
View on GitHub
Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…
☆11Feb 6, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
UKPLab / incorporating-relevance
View on GitHub
Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…
☆14Mar 30, 2026Updated 3 months ago
KellerJordan / TriMap-PyTorch
View on GitHub
Implementation of TriMap dimensionality reduction in PyTorch
☆16May 26, 2018Updated 8 years ago
potamides / uniformers
View on GitHub
Token-free Language Modeling with ByGPT5 & Friends!
☆12Jul 18, 2025Updated last year
UKPLab / AdaSent
View on GitHub
This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…
☆16Jun 3, 2024Updated 2 years ago
mfrdixon / GP-CVA
View on GitHub
☆15Oct 7, 2019Updated 6 years ago
Jwata / job-word-embeddings
View on GitHub
Word embeddings for job postings
☆13Dec 8, 2022Updated 3 years ago
salesforce / hydra-sum
View on GitHub
☆10May 1, 2025Updated last year
amazon-science / irgr
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
Mihir3009 / In-BoXBART
View on GitHub
In-BoXBART: Get Instructions into Biomedical Multi-task Learning
☆15Aug 23, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
appeler / ethnicolr2
View on GitHub
Ethnicolr implementation with new models in pytorch
☆22Updated this week
MeetElise / surprise-similarity
View on GitHub
A context-aware embedding similarity score
☆11Aug 23, 2023Updated 2 years ago
ctanujit / FEWNet
View on GitHub
☆17Mar 12, 2025Updated last year
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
dianaow / d3-network-time
View on GitHub
d3 plugin to create a temporal network visualization
☆18Jan 6, 2023Updated 3 years ago
JohnTailor / BertSenClu
View on GitHub
Topic Model based on Pretrained Sentence Embeddings (with BERT)
☆13Feb 8, 2023Updated 3 years ago
jofmi / agentpy_workshop
View on GitHub
Interactive notebooks with tutorials for the agentpy package.
☆15Feb 1, 2022Updated 4 years ago