clankur / einygpt
a transformer implemented primarily using einops and trained on the tinystories dataset
☆13Updated 6 months ago
Alternatives and similar repositories for einygpt:
Users that are interested in einygpt are comparing it to the libraries listed below
- Efficiently computing & storing token n-grams from large corpora☆17Updated 3 months ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆17Updated last year
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated 4 months ago
- Fast Neural Machine Translation in C++ - development repository☆19Updated 8 months ago
- A framework for collecting a large human-sourced chain-of-thoughts dataset☆18Updated 6 months ago
- Identify and automatically fix issues in shell scripts☆14Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated 5 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆28Updated 2 weeks ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 3 months ago
- A simple library for working with Hugging Face models.☆14Updated 2 weeks ago
- Pragmatic framework to build LLM Copilots☆16Updated last week
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 3 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated 2 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Discord bot that generates messages using GPT-2☆20Updated 5 years ago
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated 9 months ago
- arXiv plain text extraction☆41Updated 2 years ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year
- A visual tool to interpret and understand PyTorch machine learning models☆15Updated 11 months ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated last year
- NLP with Rust for Python 🦀🐍☆60Updated 7 months ago
- Benchmark structured generation libraries☆24Updated 2 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year