Creating the tools and data sets necessary to evaluate vulnerabilities in LLMs.
☆27Mar 14, 2025Updated 11 months ago
Alternatives and similar repositories for evaluating-LLMs
Users that are interested in evaluating-LLMs are comparing it to the libraries listed below
Sorting:
- A simple & community made twitter bot. It generates X for Y to help you come up with a start up idea.☆14Dec 24, 2021Updated 4 years ago
- Children's Programming and Artificial Intelligence Education☆10Dec 30, 2019Updated 6 years ago
- A library for Partially Homomorphic Encryption in Python☆12May 30, 2017Updated 8 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- Dataset for training EEG IC classifiers.☆13Aug 29, 2021Updated 4 years ago
- A file-backed dictionary for Python☆12Aug 15, 2022Updated 3 years ago
- A tiny load balancer, implemented by XDP.☆12Nov 25, 2024Updated last year
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- ☆11Jan 3, 2024Updated 2 years ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Dec 5, 2023Updated 2 years ago
- Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch☆13Jun 10, 2019Updated 6 years ago
- ☆26Jun 28, 2025Updated 8 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- 9th solution☆11Oct 11, 2022Updated 3 years ago
- Code for 'Alzheimer’s Disease Classification Using Cluster-based Labelling for Graph Neural Network on Tau PET Imaging and Heterogeneous …☆12Sep 13, 2022Updated 3 years ago
- ☆14Jun 8, 2018Updated 7 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- autoredteam: code for training models that automatically red team other language models☆15Aug 9, 2023Updated 2 years ago
- Toolkit for building prompt templates for language models☆12Sep 30, 2022Updated 3 years ago
- ☆12Dec 23, 2020Updated 5 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆68Feb 8, 2023Updated 3 years ago
- Node feature discovery, detects the available hardware features and configuration in a cluster.☆17Updated this week
- Add function calling to text-generation-inference☆13Oct 10, 2023Updated 2 years ago
- jQuery VS JS comparison table, Learn JS through jupyter notebook.☆11Sep 27, 2019Updated 6 years ago
- Project to use OpenAPI generators to build code from 5GC_API☆13Feb 15, 2023Updated 3 years ago
- ☆16Jun 20, 2023Updated 2 years ago
- The official implemention for the paper "Joint Spatial-Temporal and Appearance Modeling with Transformer for Multiple Object Tracking".☆13Oct 20, 2022Updated 3 years ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- ☆16Feb 10, 2026Updated 2 weeks ago
- Python standalone tokenizer☆15Nov 12, 2015Updated 10 years ago
- High-Speed Stateful Packet Processor for Programmable Switches☆14Dec 18, 2022Updated 3 years ago
- Let us try implementing SAN in pytorch from scratch☆16Jun 7, 2018Updated 7 years ago
- A set of procedures to estimate the readability of a text☆15Apr 30, 2018Updated 7 years ago
- Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)☆14Jan 5, 2022Updated 4 years ago
- ☆16Apr 19, 2021Updated 4 years ago
- TREC QA dataset for question answering cleaned for usage in Question Answering☆14Aug 26, 2019Updated 6 years ago
- ☆15Mar 26, 2024Updated last year
- Stock selection and portfolio performance based on ESG Scores☆14Mar 16, 2021Updated 4 years ago