jaymody / simpleGPT
Simple implementation of a GPT (training and inference) in PyTorch.
☆9Updated 9 months ago
Related projects: ⓘ
- code for paper "Accessing higher dimensions for unsupervised word translation"☆19Updated last year
- Github repo for Peifeng's internship project☆12Updated 10 months ago
- Efficiently computing & storing token n-grams from large corpora☆15Updated 2 weeks ago
- ☆65Updated 2 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Hugging Face and Pyserini interoperability☆17Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 2 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- ChatBot App built using LangChain and Lightning AI☆17Updated last year
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆13Updated 8 months ago
- Create an LLM XML context document from an llms.txt file☆13Updated 3 weeks ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆17Updated last year
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated 9 months ago
- Answer questions against collections stored in LLM using Retrieval Augmented Generation☆22Updated 7 months ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 2 years ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆15Updated 3 months ago
- A file utility for accessing both local and remote files through a unified interface.☆36Updated last month
- a graph definition and execution library for python☆16Updated last year
- ☆9Updated 5 months ago
- Scripts to parse arxiv documents for NLP tasks☆17Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆25Updated 11 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- Tools for encoding Magic: The Gathering cards into a form suitable for AI text generation☆17Updated 3 years ago
- ☆13Updated this week
- ☆14Updated last year
- Python script to quickly generate a Font Awesome icon imposed on a background for steering AI image generation.☆53Updated 2 years ago
- LLM plugin for embeddings using sentence-transformers☆41Updated 7 months ago
- Stuff related to scraping the Code Review StackExchange☆11Updated last year