clankur / einygpt
a transformer implemented primarily using einops and trained on the tinystories dataset
☆12Updated 9 months ago
Alternatives and similar repositories for einygpt:
Users that are interested in einygpt are comparing it to the libraries listed below
- Efficiently computing & storing token n-grams from large corpora☆22Updated 6 months ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆17Updated 2 weeks ago
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆81Updated last year
- PANiC - PAraphrasing Noun-Compounds☆15Updated 7 years ago
- Interview-based evaluation of LLMs☆19Updated 3 months ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- negate_sentence(A Python module that doesn't negate sentences.)☆30Updated 6 months ago
- LLM sampling method for enforcing syntax adherence in generated output☆24Updated last year
- A cost estimator for OpenAI API calls in tqdm loops.☆18Updated 4 months ago
- ☆12Updated 3 weeks ago
- ☆28Updated last week
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated 7 months ago
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆58Updated 3 years ago
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆21Updated 3 months ago
- Dynamic Adversarial Benchmarking platform☆26Updated 2 years ago
- ☆32Updated last year
- This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset☆49Updated 3 years ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 6 months ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆19Updated 9 months ago
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Updated 5 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 8 months ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆65Updated 2 years ago
- llm sampler that only allows words that are in the bible☆26Updated 4 months ago
- ☆13Updated 2 years ago
- Demos for the MiniWoB++ benchmark☆19Updated 7 years ago
- The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".☆31Updated last year
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 3 years ago