CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformers
☆19Oct 14, 2024Updated last year
Alternatives and similar repositories for gbswt5
Users that are interested in gbswt5 are comparing it to the libraries listed below
Sorting:
- ☆28Feb 21, 2025Updated last year
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Apr 11, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"☆15Jul 4, 2025Updated 8 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆15Nov 25, 2025Updated 3 months ago
- ☆10May 1, 2025Updated 10 months ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Apr 17, 2023Updated 2 years ago
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Dec 27, 2023Updated 2 years ago
- ☆10Nov 30, 2024Updated last year
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆15Feb 24, 2024Updated 2 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- ☆10Dec 8, 2022Updated 3 years ago
- ☆10Aug 6, 2022Updated 3 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- A context-aware embedding similarity score☆11Aug 23, 2023Updated 2 years ago
- MuLe: Multi-Grained Graph Learning for Multi-Behavior Recommendation (CIKM 2024)☆14Dec 21, 2024Updated last year
- This repository contains the implementation code for paper: Mixup Your Own Pairs☆12Oct 1, 2023Updated 2 years ago
- Official code for the LoG2022 paper -- MSGNN: A Spectral Graph Neural Network Based on a Novel Magnetic Signed Laplacian.☆13Feb 8, 2025Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- Pytorch implementation of standard metrics for clustering☆10Mar 21, 2023Updated 2 years ago
- 技术面试需要掌握的基础知识整理,欢迎编辑~☆10Apr 10, 2018Updated 7 years ago
- ☆41Jul 8, 2016Updated 9 years ago
- Codebase for "Linking Surface Facts to Large-Scale Knowledge Graphs" (EMNLP 2023)☆13May 8, 2024Updated last year
- Partial code for "Skill Extraction from Job Postings using Weak Supervision" at RecSysHR 2022.☆13May 19, 2023Updated 2 years ago
- Asus Prime Z490-A-OpenCore-Hackintosh☆12Aug 19, 2022Updated 3 years ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated 10 months ago
- Movie recommendation system built with factorization machines and deep learning☆10Jan 28, 2019Updated 7 years ago
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"☆14Sep 9, 2025Updated 6 months ago
- ☆12Jan 2, 2024Updated 2 years ago
- Repository to create CCKGs from the paper "Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-…☆11May 23, 2025Updated 9 months ago
- LDPC codes for Illumina sequencing-based DNA storage☆11Dec 2, 2020Updated 5 years ago
- A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for …☆20Nov 4, 2025Updated 4 months ago
- A minimal working example of using undetected-chromedriver on AWS Lambda with Selenium and Docker☆19Aug 12, 2025Updated 6 months ago
- Code for "RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion", LREC-CO…☆11May 25, 2024Updated last year