for-ai / AI-Alignment-Cohort

☆21

Alternatives and similar repositories for AI-Alignment-Cohort:

Users that are interested in AI-Alignment-Cohort are comparing it to the libraries listed below

tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆102Updated 4 months ago
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆124Updated last year
yash-srivastava19 / arrakis
Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.
☆25Updated this week
google-deepmind / mishax
☆121Updated this week
anyscale / e2e-llm-workflows
End-to-End LLM Guide
☆101Updated 7 months ago
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆81Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆255Updated last year
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆47Updated 8 months ago
neubig / minllama-assignment
☆80Updated 4 months ago
melisa-writer / short-transformers
Prune transformer layers
☆67Updated 8 months ago
apartresearch / interpretability-starter
🧠 Starter templates for doing interpretability research
☆65Updated last year
METR / ai-rd-tasks
☆61Updated 2 weeks ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆145Updated 3 months ago
adithya-s-k / indic_eval
A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks
☆32Updated 8 months ago
callummcdougall / ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
☆209Updated last year
nyunAI / Faster-LLM-Survey
☆40Updated 9 months ago
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆121Updated 9 months ago
HackerCupAI / starter-kits
☆64Updated 4 months ago
srush / GPTWorld
A puzzle to learn about prompting
☆124Updated last year
stanford-cs324 / winter2022
Website
☆51Updated 2 years ago
TransformerLensOrg / CircuitsVis
Mechanistic Interpretability Visualizations using React
☆227Updated last month
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
EleutherAI / delphi
☆149Updated this week
gau-nernst / learn-cuda
Learn CUDA with PyTorch
☆16Updated 2 weeks ago
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆217Updated this week
pacman100 / openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…
☆23Updated last year
hesamsheikh / llm-mechanics
Coding an LLM and its building blocks from scratch.
☆16Updated 2 weeks ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆75Updated 2 months ago