Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288
☆20Oct 30, 2024Updated last year
Alternatives and similar repositories for stack-eval
Users that are interested in stack-eval are comparing it to the libraries listed below
Sorting:
- The first, open access evaluation dataset for methods to identify bias by word choice and labeling☆26Oct 30, 2025Updated 4 months ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆40May 24, 2023Updated 2 years ago
- Online Binary Image Index☆13Nov 14, 2016Updated 9 years ago
- g2-MLP: State-of-the-Art Model for Node Classification on Graphs (PPI Dataset)☆10Nov 12, 2022Updated 3 years ago
- Gretchen - An Open-Source Humanoid Robot Development Platform☆11Jul 8, 2019Updated 6 years ago
- Modeller tool for Microsoft employees to figure out their compension☆10Sep 25, 2021Updated 4 years ago
- The repository for the paper "Predicting in-hospital mortality by combining clinical notes with time-series data"☆12May 23, 2021Updated 4 years ago
- ☆11Jan 27, 2026Updated last month
- GitXiv Competition: Replicate the findings of the Deep Q&A research paper, preferably in collaboration with others. Use library of choice…☆40Aug 2, 2015Updated 10 years ago
- ☆11Feb 21, 2019Updated 7 years ago
- This Project focuses on processing legal court decisions and is part of on-going research at New York University. This code is in develop…☆10Jul 17, 2019Updated 6 years ago
- This repository contains the code used in a publication 'Active Learning for Decision-Making from Imbalanced Observational Data', Iiris S…☆11May 14, 2019Updated 6 years ago
- OSX menu bar controlled Tor relay server☆17Oct 28, 2014Updated 11 years ago
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- Prototype implementation of an architecture suggested in Robot Dream paper (http://arxiv.org/abs/1603.03007)☆12Jul 3, 2019Updated 6 years ago
- GNN模型在引文网络数据集上的代码,包括Cora、Citeseer、Pubmed、ogbn-arxiv☆10Mar 2, 2021Updated 4 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 8 years ago
- Network Together: Node Classification via Cross-Network Deep Network Embedding☆11May 5, 2021Updated 4 years ago
- Replaces occurrences of the word 'literally' with 'figuratively'. That's literally all it does.☆45Nov 7, 2014Updated 11 years ago
- ☆10Oct 17, 2022Updated 3 years ago
- ☆14Jan 10, 2025Updated last year
- simple ansible playbook to take clean ubuntu 18.04 to CUDA 10, PyTorch 1.0, fastai, miniconda heaven☆12Dec 16, 2018Updated 7 years ago
- ☆43Jun 12, 2023Updated 2 years ago
- Official code for AAAI'20 paper "Merging Weak and Active Supervision for Semantic Parsing"☆11Dec 8, 2022Updated 3 years ago
- Layers, datasets and utilities for PyTorch☆10Nov 22, 2023Updated 2 years ago
- JFC! What a hot mess. *Scream into void*☆13Sep 20, 2021Updated 4 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated last year
- ☆12Dec 30, 2020Updated 5 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- ☆12Apr 30, 2019Updated 6 years ago
- An agent-based model for scientific inquiry based on abstract argumentation☆13Jan 17, 2022Updated 4 years ago
- Music segmentation by ordinal linear discriminant analysis☆18Nov 10, 2017Updated 8 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 3 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- Customized Claude Code system prompts for use with tweakcc — ~48k bytes smaller, 30% faster, same accuracy☆33Nov 23, 2025Updated 3 months ago
- data set for node classification task☆14Jan 31, 2020Updated 6 years ago