Cohere-Labs-Community / AI-Alignment-CohortLinks
☆28Updated 11 months ago
Alternatives and similar repositories for AI-Alignment-Cohort
Users that are interested in AI-Alignment-Cohort are comparing it to the libraries listed below
Sorting:
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 4 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆193Updated 3 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- GPUGrants - a list of GPU grants that I can think of☆36Updated last week
- A puzzle to learn about prompting☆135Updated 2 years ago
- ☆44Updated 3 months ago
- Fast bare-bones BPE for modern tokenizer training☆164Updated 2 months ago
- Prune transformer layers☆69Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆362Updated last week
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- Training-Ready RL Environments + Evals☆90Updated last week
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- Notes from the Latent Space paper club. Follow along or start your own!☆239Updated last year
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- ☆46Updated 5 months ago
- rl from zero pretrain, can it be done? yes.☆268Updated last month
- GPU Kernels☆193Updated 4 months ago
- An extension of the nanoGPT repository for training small MOE models.☆187Updated 6 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 6 months ago
- ☆89Updated 5 months ago
- ☆67Updated 11 months ago
- Website☆56Updated 2 years ago
- Building GPT ...☆18Updated 9 months ago
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆226Updated last month
- Open source interpretability artefacts for R1.☆158Updated 5 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
- A comprehensive deep dive into the world of tokens☆226Updated last year
- ☆142Updated last week