SOM-Research / HFCommunity
HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable data about the repositories hosted in the Hub
☆14Updated 3 months ago
Alternatives and similar repositories for HFCommunity:
Users that are interested in HFCommunity are comparing it to the libraries listed below
- Graph-based method for end-to-end code completion with context awareness on repository☆54Updated 4 months ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated 9 months ago
- Analyzing and scoring reasoning traces of LLMs☆41Updated 4 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆54Updated 4 months ago
- codellm-devkit provides unified language to get off-the-shelf static analysis for multiple programming languages and support for applyin…☆54Updated this week
- BLANCA - Benchmarks for LANguage models on Coding Artifacts☆8Updated 2 years ago
- Incremental Python parser for constrained generation of code by LLMs.☆15Updated 4 months ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆12Updated 3 months ago
- A package dedicated for running benchmark agreement testing☆15Updated last month
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated this week
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- Design and implement chatbots in Python☆41Updated this week
- Open Implementations of LLM Analyses☆96Updated 3 months ago
- A Bias Tester framework for LLMs☆14Updated 2 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆39Updated this week
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆75Updated 9 months ago
- AI Evaluation Platform☆45Updated this week
- 🦄 Unitxt: a python library for getting data fired up and set for training and evaluation☆169Updated this week
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week
- Experiments to assess SPADE on different LLM pipelines.☆16Updated 9 months ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated 6 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆28Updated 3 weeks ago
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models☆61Updated last year
- Example Bots built with the Xatkit framework☆11Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Reasoning by Communicating with Agents☆23Updated 3 months ago
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆11Updated 10 months ago