Cohere-Labs-Community / AI-Alignment-CohortLinks
☆29Updated last year
Alternatives and similar repositories for AI-Alignment-Cohort
Users that are interested in AI-Alignment-Cohort are comparing it to the libraries listed below
Sorting:
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆268Updated this week
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 8 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆259Updated 2 years ago
- ☆68Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆112Updated last year
- ☆45Updated 7 months ago
- Prune transformer layers☆74Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆129Updated 2 years ago
- A puzzle to learn about prompting☆135Updated 2 years ago
- List of online discord servers for ML collaborations.☆36Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆219Updated 9 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- ☆99Updated last year
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆236Updated 4 months ago
- GPU Kernels☆212Updated 8 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- An introduction to LLM Sampling☆79Updated last year
- Website☆57Updated 2 years ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated 2 years ago
- 🧠 Starter templates for doing interpretability research☆76Updated 2 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated 2 years ago
- ☆225Updated last month
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆141Updated 11 months ago
- Fast bare-bones BPE for modern tokenizer training☆174Updated 6 months ago
- ☆147Updated 3 months ago
- Slides, notes, and materials for the workshop☆337Updated last year
- ☆116Updated 2 weeks ago
- A comprehensive deep dive into the world of tokens☆227Updated last year