LLM-Ethics / EthicsSuite
A test suite (a.k.a., dataset) with ~20k moral situations for understanding LLMs' behaviors.
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for EthicsSuite
- In this repository, you'll find a curated selection of recent research papers, articles, and implementations from leading experts in the …☆16Updated last year
- [EMNLP 2024] CodeJudge: Evaluating Code Generation with Large Language Models☆21Updated last week
- APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking a…☆53Updated last year
- A toolkit for testing machine translation [ICSE'20, '21, ESEC/FSE'20]☆33Updated 3 years ago
- Repository for the Adversarial ML on Code things☆16Updated 4 years ago
- Replication Package for "Compressing Pre-trained Models of Code into 3 MB", ASE 2022☆26Updated last month
- Large Language Models Meet NL2Code: A Survey☆34Updated this week
- Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; L…☆22Updated 11 months ago
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆145Updated 8 months ago
- Repository for PsychoEvals - a framework for LLM security, psychoanalysis, and moderation.☆15Updated last year
- The official repository of "ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models"☆43Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆35Updated 11 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆54Updated 2 months ago
- ☆125Updated 2 months ago
- ☆190Updated 3 months ago
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆20Updated last year
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆297Updated 11 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆79Updated last year
- Evaluation results of code generation LLMs☆29Updated last year
- ☆17Updated last year
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆57Updated 2 years ago
- ☆34Updated last month
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆33Updated last year
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆59Updated 2 years ago
- Magnum-NLC2CMD is the winning solution for the NeurIPS 2020 NLC2CMD challenge.☆31Updated last year
- ☆101Updated 4 months ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆97Updated 10 months ago
- EVIL (Exploiting software VIa natural Language) is an approach to automatically generate software exploits in assembly/Python language fr…☆27Updated 2 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆12Updated 7 months ago