A classification model
☆21Apr 24, 2022Updated 3 years ago
Alternatives and similar repositories for self_pretraining
Users that are interested in self_pretraining are comparing it to the libraries listed below
Sorting:
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 4 years ago
- ☆14Aug 5, 2019Updated 6 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- Implementation of 'Commit message generation for source code change'.☆25Oct 23, 2019Updated 6 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆27Sep 12, 2021Updated 4 years ago
- Computing calibrated prediction intervals for neural network regressors☆10May 28, 2019Updated 6 years ago
- CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata☆32Jan 21, 2022Updated 4 years ago
- Mahjong solitaire as a browser game☆14Jan 26, 2024Updated 2 years ago
- [WebConf 2020] Searching for polarization in signed graphs: a local spectral approach☆10Feb 3, 2024Updated 2 years ago
- Visual-based analysis of file system metadata. The tool enables digital forensics of large volumes of data.☆10May 10, 2024Updated last year
- ☆12Feb 22, 2021Updated 5 years ago
- A collection of powershell scripts that are designed to be ran from a Microsoft Defender for Endpoint Live Response terminal, utilizing o…☆12Apr 26, 2023Updated 2 years ago
- ☆12Dec 14, 2022Updated 3 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- Public malware techniques used in the wild: Virtual Machine, Emulation, Debuggers, Sandbox detection.☆18Mar 22, 2020Updated 5 years ago
- normalizer of numerical / temporal expression☆11Sep 2, 2018Updated 7 years ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆15Jan 31, 2023Updated 3 years ago
- universal-datalakehouse-postgres-ingestion-deltastreamer☆11Apr 7, 2024Updated last year
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- Convert tag files (ctags, gccxml, etc) to databases (sqlite, mysql, etc)☆13Mar 30, 2015Updated 10 years ago
- Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."☆10Jan 26, 2020Updated 6 years ago
- Mapping Echo Chambers In Large Networks☆11Nov 8, 2024Updated last year
- The Swiss Court Ruling Corpus (SCRC) contains code for extracting information from Swiss court rulings☆11Jan 22, 2025Updated last year
- ☆11May 9, 2022Updated 3 years ago
- ☆10Sep 14, 2022Updated 3 years ago
- Malware - Machine Learning☆11Mar 24, 2018Updated 7 years ago
- This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…☆10May 9, 2024Updated last year
- Semantic Scaffolds for Pseudocode-to-Code Generation (accepted by ACL 2020)☆14Jun 7, 2021Updated 4 years ago
- ☆10May 11, 2024Updated last year
- Word Familiarity Rate for 'Word List by Semantic Principles (WLSP)'☆12Jan 2, 2025Updated last year
- Literary Language Toolkit: code, models, corpora, and web tools☆11Mar 28, 2024Updated last year
- ☆12Nov 9, 2018Updated 7 years ago
- Code and data for the CIKM2021 paper "Learning Ideological Embeddings From Information Cascades"☆10Sep 8, 2021Updated 4 years ago
- Machine learning for malware detection☆11Aug 2, 2016Updated 9 years ago
- Word embeddings trained on medical subreddits.☆10Jan 4, 2021Updated 5 years ago
- ☆10Nov 15, 2021Updated 4 years ago
- ☆10May 26, 2022Updated 3 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago