NL2Code / CodeMLinks

☆44

Alternatives and similar repositories for CodeM

Users that are interested in CodeM are comparing it to the libraries listed below

Sorting:

nyu-mll / ILF-for-code-generation
☆78Updated 4 months ago
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆65Updated 2 years ago
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆55Updated last year
choosewhatulike / case2code
☆15Updated 3 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆59Updated last year
zorazrw / odex
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆48Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
gersteinlab / Struc-Bench
[NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…
☆54Updated this week
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 2 weeks ago
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated 2 years ago
PootieT / explain-then-translate
Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…
☆29Updated last year
Zyphra / Zyda_processing
☆37Updated last year
zhangir-azerbayev / proof-pile
Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.
☆21Updated 2 years ago
luohongyin / EntST
Entailment self-training
☆25Updated 2 years ago
Eureka6174 / LearnNLPlan
Learning to Program with Natural Language
☆6Updated last year
laramohan / wikillm
LLMs as Collaboratively Edited Knowledge Bases
☆45Updated last year
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated 2 years ago
oriyor / assistantbench
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆59Updated 7 months ago
vaguenebula / AlpacaDataReflect
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Updated 2 years ago
DAMO-NLP-SG / CaRing
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
☆38Updated last year
TheDuckAI / arb
Advanced Reasoning Benchmark Dataset for LLMs
☆47Updated last year
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
Edward-Sun / RECITE
Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI
☆94Updated 2 years ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
allenai / super-benchmark
☆45Updated 4 months ago
sambanova / generative_data_prep
☆64Updated 3 months ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated last year