Creating the tools and data sets necessary to evaluate vulnerabilities in LLMs.
☆27Mar 14, 2025Updated last year
Alternatives and similar repositories for evaluating-LLMs
Users that are interested in evaluating-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆13Nov 14, 2022Updated 3 years ago
- ☆22Jan 5, 2024Updated 2 years ago
- My solution for the ''LLM - Detect AI Generated Text'' kaggle competition☆16Feb 2, 2024Updated 2 years ago
- Children's Programming and Artificial Intelligence Education☆11Dec 30, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A library for Partially Homomorphic Encryption in Python☆12May 30, 2017Updated 8 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- 3rd Place solution for Feedback Prize - Predicting Effective Arguments Kaggle competition☆16Sep 6, 2022Updated 3 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- Stacking Machine Learning Models. Tunning; feature engineering, scaling, models combinations and parameters.☆11Oct 4, 2020Updated 5 years ago
- Deep Just-In-Time Inconsistency Detection Between Comments and Source Code: Artifact☆23Jul 21, 2025Updated 8 months ago
- ☆18Mar 25, 2024Updated 2 years ago
- Netflix for XBMC☆61Nov 13, 2012Updated 13 years ago
- 6th Position Solution Code for Kaggle - LLM Science Exam Competition☆24Jul 8, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Materials for the "Modern NLP: Pre-training, Fine-tuning, Prompt Engineering, and Human Feedback" workshop at ODSC East 2023☆23Oct 5, 2023Updated 2 years ago
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA☆13Jul 22, 2019Updated 6 years ago
- ☆68Sep 29, 2020Updated 5 years ago
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆29May 30, 2023Updated 2 years ago
- a plugin for stackstorm☆14Feb 13, 2019Updated 7 years ago
- An awsome epub3 library.☆15Dec 2, 2023Updated 2 years ago
- Toolkit for building prompt templates for language models☆12Sep 30, 2022Updated 3 years ago
- Benchmarking various Deep Learning models such as BERT, ALBERT, BiLSTMs on the task of sentence entailment using two datasets - MultiNLI …☆28Dec 31, 2020Updated 5 years ago
- ☆27Nov 6, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A curated list of resources, services, and apps for teaching and learning☆12Mar 29, 2021Updated 5 years ago
- Rasa X Jokebot Demo☆16Apr 8, 2024Updated 2 years ago
- A fullstack web app uses material ui and echarts to represent statistics and uses cheerio to scrape coronavirus data in the world.☆13Feb 13, 2026Updated last month
- Accompanies Finastra's Hack to the Future 4 Learning Session "Sustainability reports & NLP"☆10Mar 17, 2022Updated 4 years ago
- Code for 'Alzheimer’s Disease Classification Using Cluster-based Labelling for Graph Neural Network on Tau PET Imaging and Heterogeneous …☆12Sep 13, 2022Updated 3 years ago
- A plugin for the GATE language technology framework for training and using machine learning models. Currently supports Mallet (MaxEnt, N…☆28Apr 17, 2023Updated 2 years ago
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆294Nov 25, 2023Updated 2 years ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Modified colab notebook to train StyleGAN3 on Google Colab☆12Apr 3, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- By fine tuning GPT2 on News Aggregator data☆15Jan 24, 2021Updated 5 years ago
- Data labeling using few shot learning GPT-3.☆25Mar 26, 2023Updated 3 years ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Sep 23, 2022Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Mar 30, 2026Updated last week
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- ☆14Sep 16, 2021Updated 4 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago