clankur / einygptLinks
a transformer implemented primarily using einops and trained on the tinystories dataset
☆12Updated last year
Alternatives and similar repositories for einygpt
Users that are interested in einygpt are comparing it to the libraries listed below
Sorting:
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Updated last year
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆18Updated 2 months ago
- ☆73Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- LLM sampling method for enforcing syntax adherence in generated output☆25Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- Interview-based evaluation of LLMs☆20Updated 5 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆59Updated 3 years ago
- LLMs playing chess are sensitive to how the position came to be☆23Updated last year
- Model implementation for the contextual embeddings project☆33Updated 3 weeks ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use☆19Updated last week
- Run code inference-only benchmarks quickly using vLLM☆10Updated 3 months ago
- llm sampler that only allows words that are in the bible☆27Updated 6 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 9 months ago
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated 9 months ago
- Efficiently computing & storing token n-grams from large corpora☆24Updated 8 months ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆19Updated last year
- ☆23Updated 4 months ago
- a writeup on some experiments on a sequence model for chess games☆30Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆45Updated last year
- PANiC - PAraphrasing Noun-Compounds☆15Updated 7 years ago
- ☆37Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago