Genaios / TextMachina
A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for MGT-related tasks such as detection, attribution, and boundary detection.
☆15Updated 6 months ago
Alternatives and similar repositories for TextMachina:
Users that are interested in TextMachina are comparing it to the libraries listed below
- ☆11Updated last year
- Official code repository for article Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts☆26Updated last year
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆12Updated 6 months ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆13Updated 4 months ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆41Updated 2 years ago
- Code and datasets for EMNLP 2022 paper: Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Repr…☆19Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆36Updated 3 months ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆47Updated 2 years ago
- Text generation using language models with multiple exit heads☆15Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆39Updated 9 months ago
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"☆54Updated last year
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations☆12Updated 3 months ago
- Models for automatically transforming toxic text to neutral☆33Updated last year
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆14Updated 10 months ago
- ☆30Updated 3 months ago
- Embedding Recycling for Language models☆38Updated last year
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Updated 9 months ago
- ☆11Updated 2 years ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆21Updated 9 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- Interpretable unified language safety checking with large language models☆30Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆64Updated 7 months ago
- Code/data for MARG (multi-agent review generation)☆38Updated 2 months ago
- The official repo for SocKET: Social Knowledge Evaluation Tests☆22Updated last year
- ☆20Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- Repository for the ACL 2024 conference website☆18Updated 3 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 6 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year