google-deepmind / transformer_ngrams
☆14Updated 4 months ago
Alternatives and similar repositories for transformer_ngrams:
Users that are interested in transformer_ngrams are comparing it to the libraries listed below
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 8 months ago
- ☆22Updated 3 weeks ago
- ☆15Updated 5 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 5 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- LLM training in simple, raw C/CUDA☆14Updated 3 months ago
- Code for☆24Updated 3 months ago
- ☆46Updated 4 months ago
- ☆27Updated 4 months ago
- An introduction to LLM Sampling☆77Updated 3 months ago
- ☆31Updated 10 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆21Updated last month
- Collection of resources for RL and Reasoning☆25Updated last month
- Generate graph/data embeddings multiple ways☆46Updated this week
- Tiny evaluation of leading LLMs on competitive programming problems☆14Updated 3 months ago
- PyTorch implementation for MRL☆18Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 7 months ago
- Learn online intrinsic rewards from LLM feedback☆35Updated 3 months ago
- ☆19Updated 7 months ago
- ☆41Updated 2 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆26Updated 2 weeks ago
- LLM reads a paper and produce a working prototype☆51Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- Evaluation of neuro-symbolic engines☆35Updated 7 months ago
- ☆38Updated 7 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆21Updated 3 weeks ago
- ☆27Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Repository to create traveling waves integrate special information through time☆49Updated 2 weeks ago
- Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model☆14Updated last year