A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for MGT-related tasks such as detection, attribution, and boundary detection.
☆19Nov 4, 2025Updated 3 months ago
Alternatives and similar repositories for TextMachina
Users that are interested in TextMachina are comparing it to the libraries listed below
Sorting:
- Official code repository for article Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts☆33Mar 15, 2025Updated 11 months ago
- ☆11Jan 8, 2025Updated last year
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- ☆10May 1, 2025Updated 10 months ago
- OneStop: A 360-Participant Eye Tracking Dataset with Different Reading Regimes☆16Dec 5, 2025Updated 2 months ago
- Data from the Sequoia treebank.☆11Feb 19, 2026Updated last week
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- Ensemble Integration: a customizable pipeline for generating multi-modal, heterogeneous ensembles☆21Oct 30, 2024Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- A context-aware embedding similarity score☆11Aug 23, 2023Updated 2 years ago
- Streamlit Multi AI Platform Chat App☆10Nov 5, 2024Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- Neural ngram language model in PyTorch.☆10Sep 27, 2018Updated 7 years ago
- Estimation of the confidence measure for anomaly detectors, as explained in the paper "Quantifying the Confidence of Anomaly Detectors in…☆12Nov 17, 2021Updated 4 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 2 years ago
- ☆10Dec 21, 2024Updated last year
- Docker image for NS-3 Network Simulator v.3.30☆13Apr 11, 2021Updated 4 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Nov 5, 2020Updated 5 years ago
- ☆10Oct 31, 2023Updated 2 years ago
- Autoencoder based iterative modeling and multivariate time-series subsequence clustering algorithm (ABIMCA)☆12Dec 20, 2022Updated 3 years ago
- Pytorch implementation of standard metrics for clustering☆10Mar 21, 2023Updated 2 years ago
- These are tools I cheated with the help of ChatGPT to help me with Penetration Testing and Red Teaming☆15Feb 24, 2024Updated 2 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- This repository contains the implementation code for paper: Mixup Your Own Pairs☆12Oct 1, 2023Updated 2 years ago
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Dec 27, 2023Updated 2 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Framework for training dependency parsing models.☆12Jun 12, 2024Updated last year
- ACL style for Typst☆21Jan 27, 2026Updated last month
- Python library of Dynamic Treatment Regimes☆10Oct 26, 2020Updated 5 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated 10 months ago
- Topic Model based on Pretrained Sentence Embeddings (with BERT)☆13Feb 8, 2023Updated 3 years ago
- A minimal working example of using undetected-chromedriver on AWS Lambda with Selenium and Docker☆19Aug 12, 2025Updated 6 months ago
- ☆12Jan 2, 2024Updated 2 years ago