bigcode-project / starcoder2-self-align

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

☆221

Related projects: ⓘ

evalplus / repoqa
RepoQA: Evaluating Long-Context Code Understanding
☆96Updated this week
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆195Updated 3 months ago
yuchenlin / ZeroEval
A simple unified framework for evaluating LLMs
☆121Updated this week
uukuguy / multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆139Updated 7 months ago
zhentingqi / rStar
☆262Updated this week
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆264Updated 8 months ago
LiveCodeBench / LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
☆173Updated 3 weeks ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers model
☆154Updated 4 months ago
imoneoi / multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
☆170Updated last month
arcee-ai / PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆177Updated 4 months ago
bigcode-project / octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
☆421Updated last month
cognitivecomputations / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆229Updated 3 months ago
Re-Align / URIAL
☆284Updated 3 months ago
emrgnt-cmplxty / zero-shot-replication
☆71Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆112Updated last month
arcee-ai / DistillKit
An Open Source Toolkit For LLM Distillation
☆284Updated last month
abacaj / code-eval
Run evaluation on LLMs using human-eval benchmark
☆374Updated last year
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆107Updated last year
FasterDecoding / BitDelta
☆174Updated 4 months ago
wuhy68 / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆123Updated 6 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆117Updated 8 months ago
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆115Updated 5 months ago
Psycoy / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆200Updated this week
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆218Updated 5 months ago
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆158Updated 2 months ago
FlagOpen / TACO
☆131Updated last month
allenai / lumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
☆439Updated 6 months ago
my-other-github-account / llm-humaneval-benchmarks
☆86Updated last year
astramind-ai / BitMat
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆155Updated 2 months ago
dwzhu-pku / LongEmbed
Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"
☆108Updated 4 months ago