SOM-Research / HFCommunityLinks
HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable data about the repositories hosted in the Hub
☆15Updated 11 months ago
Alternatives and similar repositories for HFCommunity
Users that are interested in HFCommunity are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆56Updated last week
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆66Updated last year
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆56Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Updated last year
- code for "Natural Language to Code Translation with Execution"☆41Updated 2 years ago
- Web queries dataset for code search☆32Updated 2 years ago
- ☆28Updated 3 weeks ago
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆79Updated last year
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆14Updated last year
- [ACL 2024-Main] ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models☆16Updated 8 months ago
- ☆14Updated last week
- ☆78Updated 6 months ago
- A collection of recent papers, benchmarks and datasets of AI4Code domain.☆58Updated last year
- Semantic Code Search☆36Updated 2 years ago
- For our ACL25 Paper: Can Language Models Replace Programmers? RepoCod Says ‘Not Yet’ - by Shanchao Liang and Yiran Hu and Nan Jiang and L…☆22Updated 3 weeks ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆249Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆26Updated 2 years ago
- Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.☆22Updated 3 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated last year
- ☆39Updated 3 months ago
- ☆14Updated last year
- [ICML 2021] Break-It-Fix-It: Unsupervised Learning for Program Repair☆119Updated 2 years ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆62Updated 11 months ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆41Updated 2 years ago
- Jigsaw Dataset: Natural language to Python Pandas code☆53Updated 2 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆15Updated 6 months ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆23Updated 3 years ago
- ☆15Updated 3 years ago
- Evaluation results of code generation LLMs☆31Updated 2 years ago
- ☆36Updated 3 years ago