kr-ramesh / synthtextevalLinks
SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)
☆23Updated 2 weeks ago
Alternatives and similar repositories for synthtexteval
Users that are interested in synthtexteval are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆73Updated 4 months ago
- Code for Zero-Shot Tokenizer Transfer☆141Updated 10 months ago
- https://footprints.baulab.info☆17Updated last year
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆34Updated 2 years ago
- Resources for cultural NLP research☆107Updated last month
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆25Updated 3 months ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆40Updated 11 months ago
- Repository for research in the field of Responsible NLP at Meta.☆202Updated 6 months ago
- ☆34Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆141Updated 4 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 7 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆92Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year
- Synthetic Data Generation for Evaluation☆13Updated 9 months ago
- A simple evaluation of generative language models and safety classifiers.☆76Updated 3 weeks ago
- Repository for the ACL 2024 conference website☆18Updated 9 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆49Updated 4 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- ☆65Updated 2 years ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆141Updated last month
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆80Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆21Updated 3 months ago
- ☆58Updated last year
- State-of-the-art paired encoder and decoder models (17M-1B params)☆53Updated 3 months ago
- A package dedicated for running benchmark agreement testing☆18Updated 2 months ago
- ☆82Updated this week
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago