LLM-Ethics / EthicsSuite

A test suite (a.k.a., dataset) with ~20k moral situations for understanding LLMs' behaviors.

☆12

Related projects ⓘ

Alternatives and complementary repositories for EthicsSuite

ChiYeungLaw / Awsome-Code-Intelligence
In this repository, you'll find a curated selection of recent research papers, articles, and implementations from leading experts in the …
☆16Updated last year
VichyTong / CodeJudge
[EMNLP 2024] CodeJudge: Evaluating Code Generation with Large Language Models
☆21Updated last week
JohnnyPeng18 / APIBench
APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking a…
☆53Updated last year
RobustNLP / TestTranslation
A toolkit for testing machine translation [ICSE'20, '21, ESEC/FSE'20]
☆33Updated 3 years ago
jjhenkel / averloc
Repository for the Adversarial ML on Code things
☆16Updated 4 years ago
soarsmu / Compressor
Replication Package for "Compressing Pre-trained Models of Code into 3 MB", ASE 2022
☆26Updated last month
NL2Code / NL2Code.github.io
Large Language Models Meet NL2Code: A Survey
☆34Updated this week
yueyueL / ReliableLM4Code
Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; L…
☆22Updated 11 months ago
imagination-research / sot
[ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
☆145Updated 8 months ago
NextWordDev / psychoevals
Repository for PsychoEvals - a framework for LLM security, psychoanalysis, and moderation.
☆15Updated last year
RUCAIBox / ChatCoT
The official repository of "ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models"
☆43Updated last year
SalesforceAIResearch / CodeChain
Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"
☆35Updated 11 months ago
Ablustrund / APPS_Plus
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆54Updated 2 months ago
DeepSoftwareAnalytics / SoTaNa
☆125Updated 2 months ago
zorazrw / awesome-tool-llm
☆190Updated 3 months ago
young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated last year
DeepSoftwareAnalytics / Telly
Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
☆20Updated last year
Gentopia-AI / Gentopia
Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.
☆297Updated 11 months ago
niansong1996 / lever
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆79Updated last year
jamesmurdza / humaneval-results
Evaluation results of code generation LLMs
☆29Updated last year
wangdeze18 / Multilingual-Adapter-for-SE
☆17Updated last year
yuewang-cuhk / awesome-programming-language-pretraining-papers
Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)
☆57Updated 2 years ago
shailja-thakur / CodeGen-Fine-Tuning
☆34Updated last month
awslabs / diagnostic-robustness-text-to-sql
Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"
☆33Updated last year
microsoft / ReACC
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆59Updated 2 years ago
magnumresearchgroup / Magnum-NLC2CMD
Magnum-NLC2CMD is the winning solution for the NeurIPS 2020 NLC2CMD challenge.
☆31Updated last year
amazon-science / mxeval
☆101Updated 4 months ago
reddy-lab-code-research / PPOCoder
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆97Updated 10 months ago
dessertlab / EVIL
EVIL (Exploiting software VIa natural Language) is an approach to automatically generate software exploits in assembly/Python language fr…
☆27Updated 2 years ago
thepurpleowl / codequeries-benchmark
Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024
☆12Updated 7 months ago