Cohere-Labs-Community / AI-Alignment-CohortLinks

☆28

Alternatives and similar repositories for AI-Alignment-Cohort

Users that are interested in AI-Alignment-Cohort are comparing it to the libraries listed below

Sorting:

MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated 2 months ago
yash-srivastava19 / arrakis
Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.
☆31Updated 3 months ago
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated 10 months ago
hkproj / multi-latent-attention
☆43Updated 2 months ago
srush / GPTWorld
A puzzle to learn about prompting
☆132Updated 2 years ago
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆48Updated last year
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆164Updated 4 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
adithya-s-k / indic_eval
A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks
☆37Updated last year
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆125Updated last year
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆318Updated 2 weeks ago
HackerCupAI / starter-kits
☆64Updated 9 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆66Updated 2 weeks ago
callummcdougall / ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
☆220Updated last year
1y33 / 100Days
GPU Kernels
☆191Updated 3 months ago
melisa-writer / short-transformers
Prune transformer layers
☆69Updated last year
ThinamXx / build-GPT
Building GPT ...
☆18Updated 8 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆196Updated 2 months ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆164Updated last month
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 3 months ago
kmohan321 / Research_Papers
☆46Updated 4 months ago
apartresearch / interpretability-starter
🧠 Starter templates for doing interpretability research
☆73Updated 2 years ago
sangmichaelxie / cs324_p2
Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
☆105Updated 2 years ago
rasbt / RAGs
RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems
☆122Updated 6 months ago
eugeneyan / llm-paper-notes
Notes from the Latent Space paper club. Follow along or start your own!
☆235Updated last year
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆388Updated last year
marin-community / marin
☆347Updated this week
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆86Updated last year
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆64Updated 2 months ago
jcolano / DPO
Direct Preference Optimization Implementation
☆16Updated last year