attentionmech / dexLinks
Pokedex for LLMs
☆13Updated 7 months ago
Alternatives and similar repositories for dex
Users that are interested in dex are comparing it to the libraries listed below
Sorting:
- Fork of Flame repo for training of some new stuff in development☆19Updated last week
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆21Updated 5 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆66Updated last week
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆15Updated last year
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆21Updated last year
- Simple repository for training small reasoning models☆46Updated 9 months ago
- ☆20Updated 8 months ago
- aesthetic tensor visualiser☆27Updated 7 months ago
- ☆55Updated last year
- GoldFinch and other hybrid transformer components☆45Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆28Updated 3 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated last year
- We study toy models of skill learning.☆31Updated 10 months ago
- LLM training in simple, raw C/CUDA☆15Updated 11 months ago
- Jax like function transformation engine but micro, microjax☆33Updated last year
- Code for the paper "Function-Space Learning Rates"☆23Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Collection of autoregressive model implementation☆86Updated 7 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 6 months ago
- Minimum Description Length probing for neural network representations☆20Updated 10 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆29Updated 5 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆42Updated 4 months ago
- PyTorch implementation for MRL☆20Updated last year
- alternative way to calculating self attention☆18Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆59Updated last month
- ☆49Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- Utilities for Training Very Large Models☆58Updated last year