microsoft / CodeTLinks
β671Updated last year
Alternatives and similar repositories for CodeT
Users that are interested in CodeT are comparing it to the libraries listed below
Sorting:
- π OctoPack: Instruction Tuning Code Large Language Modelsβ474Updated 9 months ago
- Run evaluation on LLMs using human-eval benchmarkβ424Updated 2 years ago
- PaL: Program-Aided Language Models (ICML 2023)β517Updated 2 years ago
- A framework for the evaluation of autoregressive code generation language models.β1,002Updated 4 months ago
- β481Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agentsβ555Updated 2 years ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.β758Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β259Updated last year
- β276Updated 2 years ago
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β181Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023β251Updated last year
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.β552Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diveβ¦β970Updated last year
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898β230Updated last year
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"β804Updated last year
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Modelβ556Updated 10 months ago
- β379Updated 2 years ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Themβ530Updated last year
- Fine-tune SantaCoder for Code/Text Generation.β194Updated 2 years ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β556Updated 10 months ago
- [NeurIPS 2022] πWebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agentsβ436Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"β472Updated last year
- LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)β566Updated last year
- CodeGen2 models for program synthesisβ271Updated 2 years ago
- A multi-programming language benchmark for LLMsβ283Updated 2 weeks ago
- A hard gym for programmingβ162Updated last year
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting witβ¦β1,106Updated last year
- FacTool: Factuality Detection in Generative AIβ895Updated last year
- β768Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debateβ491Updated 7 months ago