clankur / einygptLinks

a transformer implemented primarily using einops and trained on the tinystories dataset

☆12

Alternatives and similar repositories for einygpt

Users that are interested in einygpt are comparing it to the libraries listed below

Sorting:

jackbandy / bookcorpus-datasheet
Documentation effort for the BookCorpus dataset
☆34Updated 4 years ago
AlexWan0 / infini-gram
An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)
☆33Updated last year
mukhal / intrinsic-source-citation
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆18Updated 2 months ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
mgerstgrasser / tacheles
a lightweight, open-source blueprint for building powerful and scalable LLM chat applications
☆28Updated last year
IsaacRe / Syntactically-Constrained-Sampling
LLM sampling method for enforcing syntax adherence in generated output
☆25Updated 2 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 2 years ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
loretoparisi / hf-experiments
Experiments with Hugging Face 🔬 🤗
☆44Updated 10 months ago
interview-eval / interview-eval
Interview-based evaluation of LLMs
☆20Updated 5 months ago
TristanThrush / i-am-a-strange-dataset
Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆44Updated last year
basusourya / mirostat
Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).
☆59Updated 3 years ago
dpaleka / llm-chess-proofgame
LLMs playing chess are sensitive to how the position came to be
☆23Updated last year
illuin-tech / contextual-embeddings
Model implementation for the contextual embeddings project
☆33Updated 3 weeks ago
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
Phylliida / OpenClio
Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use
☆19Updated last week
iNeil77 / vllm-code-harness
Run code inference-only benchmarks quickly using vLLM
☆10Updated 3 months ago
vgel / biblically-accurate-sampler
llm sampler that only allows words that are in the bible
☆27Updated 6 months ago
morgangiraud / text-to-sql-proto
A text-to-SQL prototype on the northwind sqlite dataset
☆12Updated 9 months ago
thisisanshgupta / Senna
Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…
☆19Updated 9 months ago
EleutherAI / tokengrams
Efficiently computing & storing token n-grams from large corpora
☆24Updated 8 months ago
minimaxir / pokemon-embeddings
Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.
☆19Updated last year
lightblue-tech / lb-reranker
☆23Updated 4 months ago
ricsonc / transformers-play-chess
a writeup on some experiments on a sequence model for chess games
☆30Updated 3 years ago
EleutherAI / best-download
URL downloader supporting checkpointing and continuous checksumming.
☆19Updated last year
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
shirley-wu / cot_decoding
☆45Updated last year
vered1986 / panic
PANiC - PAraphrasing Noun-Compounds
☆15Updated 7 years ago
deep-diver / LLM-Pref-Mark-UI
☆37Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago