lucasmllr / xsbertLinks
explainable Siamese sentence transformers
☆12Updated last year
Alternatives and similar repositories for xsbert
Users that are interested in xsbert are comparing it to the libraries listed below
Sorting:
- ITALIC: An ITALian Intent Classification Dataset☆14Updated last year
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- This repository contains an extension of fairseq for pixel / visual representations for machine translation.☆35Updated last year
- A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.☆102Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆76Updated last year
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 9 months ago
- A curated list of research papers and resources on Cultural LLM.☆44Updated 9 months ago
- Repository of the COLING 2022 paper : Ordinal Log-Loss - A simple log-based loss function for ordinal text classification.☆30Updated 2 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆82Updated 9 months ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated last year
- Measuring the Mixing of Contextual Information in the Transformer☆30Updated 2 years ago
- ☆11Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆55Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 6 months ago
- A library for minimum Bayes risk (MBR) decoding☆42Updated 3 weeks ago
- Official implementation of "GPT or BERT: why not both?"☆53Updated 2 weeks ago
- Interpretability for sequence generation models 🐛 🔍☆425Updated 2 months ago
- Code for Zero-Shot Tokenizer Transfer☆133Updated 5 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 2 months ago
- An opinionated NLP research template☆11Updated 9 months ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.☆292Updated 11 months ago
- NTREX -- News Test References for MT Evaluation☆83Updated last year
- ☆150Updated 10 months ago
- ☆213Updated this week
- This is the data associated with the PERSUADE Corpus 2.0 version☆43Updated 7 months ago
- German Alpaca Dataset (Cleaned + Translated)☆25Updated 2 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated last year
- Efficient Transformers with Dynamic Token Pooling☆61Updated 2 years ago