UKPLab / acl2024-ircoderLinks

Data creation, training and eval scripts for the IRCoder paper

☆18

Alternatives and similar repositories for acl2024-ircoder

Users that are interested in acl2024-ircoder are comparing it to the libraries listed below

Sorting:

JetBrains-Research / lca-baselines
Baselines for all tasks from Long Code Arena benchmarks 🏟️
☆30Updated 2 months ago
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆114Updated 11 months ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆112Updated 9 months ago
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆68Updated 2 years ago
OSU-NLP-Group / LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆69Updated last year
princeton-nlp / LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆127Updated 11 months ago
floatai / HumanEval-XL
[LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
☆39Updated 3 months ago
Bolin97 / awesome-instruction-selector
Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning
☆44Updated last year
halfrot / ALaRM
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
☆25Updated last year
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year
RUCAIBox / HaluEval-2.0
☆43Updated last year
hanqi-qi / Mirror
☆12Updated last year
hongbinye / Cognitive-Mirage-Hallucinations-in-LLMs
Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"
☆47Updated last year
reddy-lab-code-research / PPOCoder
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆113Updated last year
Alex-HaochenLi / RACS
[EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'
☆27Updated last year
SuperBruceJia / Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
☆99Updated 10 months ago
YuxiXie / SelfEval-Guided-Decoding
☆98Updated last year
tengxiaoliu / XoT
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆27Updated last year
sunlab-osu / Understanding-CoT
☆87Updated 2 years ago
oriyor / ret-robust
Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"
☆69Updated 10 months ago
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆55Updated last year
ZeroYuHuang / Transformer-Patcher
☆31Updated last year
microsoft / ReACC
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆62Updated 3 years ago
YihongDong / CDD-TED4LLMs
☆15Updated 6 months ago
RUCAIBox / RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆38Updated last year
xingyaoww / mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…
☆125Updated last year
KwanWaiChung / MT-Eval
Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"
☆41Updated 8 months ago
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆81Updated 10 months ago
CodeEditorBench / CodeEditorBench
☆46Updated last year
hanxuhu / SeqIns
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆29Updated 6 months ago