microsoft / SafeNLPLinks
Safety Score for Pre-Trained Language Models
☆96Updated 2 years ago
Alternatives and similar repositories for SafeNLP
Users that are interested in SafeNLP are comparing it to the libraries listed below
Sorting:
- This project studies the performance and robustness of language models and task-adaptation methods.☆155Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆85Updated 2 years ago
- Pretraining Efficiently on S2ORC!☆178Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆61Updated 11 months ago
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- An instruction-based benchmark for text improvements.☆143Updated 3 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆210Updated last year
- A unified benchmark for math reasoning☆89Updated 2 years ago
- ☆145Updated 11 months ago
- ☆184Updated 2 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆78Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆96Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 6 months ago
- This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.1…☆136Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆179Updated 2 years ago
- ☆180Updated 2 years ago
- ☆44Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆57Updated last year
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 3 years ago
- 🚢 Data Toolkit for Sailor Language Models☆95Updated 10 months ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆121Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆183Updated 3 years ago
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆137Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆95Updated 2 years ago
- ☆162Updated last year
- Tools for managing datasets for governance and training.☆87Updated last month