ajtejankar / mixtral-vis-moeLinks

Visualize expert firing frequencies across sentences in the Mixtral MoE model

☆18

Alternatives and similar repositories for mixtral-vis-moe

Users that are interested in mixtral-vis-moe are comparing it to the libraries listed below

Sorting:

tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆81Updated last month
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆48Updated 5 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
mkuchnik / relm
ReLM is a Regular Expression engine for Language Models
☆106Updated 2 years ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
arcee-ai / DAM
☆52Updated 8 months ago
CERC-AAI / Robin
☆63Updated 9 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆65Updated 3 months ago
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆65Updated 2 years ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated 11 months ago
kevinwu23 / StanfordFineTuneBench
☆30Updated 8 months ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
LLM360 / crystalcoder-train
Pre-training code for CrystalCoder 7B LLM
☆54Updated last year
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 10 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated 11 months ago
awslabs / extending-the-context-length-of-open-source-llms
☆56Updated 3 weeks ago
teknium1 / LLM-Logbook
Public reports detailing responses to sets of prompts by Large Language Models.
☆30Updated 6 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆106Updated 7 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 2 months ago
Pleias / Pleias-RAG-Library
Python library to use Pleias-RAG models
☆58Updated 2 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 9 months ago
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆102Updated last year
HazyResearch / cartridges
Storing long contexts in tiny caches with self-study
☆89Updated this week
mungg / FABLES
☆57Updated 9 months ago
Zyphra / Zyda_processing
☆36Updated last year
mobiusml / aana_sdk
Aana SDK is a powerful framework for building AI enabled multimodal applications.
☆49Updated this week