Official implementation for the paper, StackEval: Benchmarking LLMs in Coding Assistance, https://arxiv.org/abs/2412.05288
☆20Oct 30, 2024Updated last year
Alternatives and similar repositories for stack-eval
Users that are interested in stack-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Dec 11, 2024Updated last year
- SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents☆84May 13, 2026Updated 3 weeks ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- the source code of IJCAI 2023 paper "Multi-Scale subgraph contrastive learning"☆11Apr 25, 2023Updated 3 years ago
- Language Models for Code Completion: a Practical Evaluation☆13Jan 19, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- g2-MLP: State-of-the-Art Model for Node Classification on Graphs (PPI Dataset)☆10Nov 12, 2022Updated 3 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- Network Together: Node Classification via Cross-Network Deep Network Embedding☆11May 5, 2021Updated 5 years ago
- ☆19Aug 18, 2019Updated 6 years ago
- Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)☆10Sep 23, 2021Updated 4 years ago
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- ☆15Oct 4, 2024Updated last year
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- Official implement of RAHG: A Role-Aware Hypergraph Neural Network for Node Classification in Graphs.☆11Jul 5, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Source code and dataset for KDD 2021 paper: Zero-shot Node Classification with Decomposed Graph Prototype Network.☆12Jun 18, 2021Updated 4 years ago
- data set for node classification task☆14Jan 31, 2020Updated 6 years ago
- codes of LEGNN for Semi-supervised Node Classification☆12Jun 1, 2022Updated 4 years ago
- A collection of publications that works on code models but beyond focusing on the accuracies.☆12Jun 30, 2023Updated 2 years ago
- Fixes the rotation of the images based on EXIF data☆15Apr 6, 2026Updated 2 months ago
- 图神经网络在推荐系统的应用☆13Aug 26, 2021Updated 4 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.☆14Oct 19, 2024Updated last year
- ☆12Mar 15, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Mass Android app vulnerability analysis toolkit☆13Dec 6, 2016Updated 9 years ago
- Pytorch Implementation of LoG 22 [Oral] -- Transductive Linear Probing: A Novel Framework for Few-Shot Node Classification☆17May 31, 2023Updated 3 years ago
- The replication package of <Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?>. Accepted by IC…☆11Nov 29, 2023Updated 2 years ago
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Oct 19, 2021Updated 4 years ago
- Implementation of the Paper "Goal-Driven Explainable Clustering via Language Descriptions"☆40May 24, 2023Updated 3 years ago
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"☆18Mar 7, 2025Updated last year
- This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO☆34Nov 26, 2025Updated 6 months ago
- ☆17May 25, 2020Updated 6 years ago
- This is the code for the paper "Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation".☆38Apr 15, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 一种基于属性和图神经网络的推荐算法——本科生毕设☆14Mar 20, 2021Updated 5 years ago
- Data Augmentation on Graphs: A Technical Survey☆15Feb 12, 2023Updated 3 years ago
- [PLDI 19'] An Inductive Synthesis Framework for Verifiable Reinforcement Learning☆14Jan 14, 2020Updated 6 years ago
- ☆11Oct 17, 2019Updated 6 years ago
- This is a ROS catkin workspace for a robot in frc☆14Dec 16, 2020Updated 5 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11May 26, 2026Updated last week
- Relative and Absolute Location Embedding for Few-Shot Node Classification on Graph☆13Nov 13, 2022Updated 3 years ago