jaymody / simpleGPT
Simple implementation of a GPT (training and inference) in PyTorch.
☆10Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for simpleGPT
- Efficiently computing & storing token n-grams from large corpora☆15Updated last month
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated 10 months ago
- Github repo for Peifeng's internship project☆12Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Training hybrid models for dummies.☆15Updated 3 weeks ago
- Rust bindings for CTranslate2☆13Updated last year
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆17Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 5 months ago
- ☆12Updated 6 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆15Updated 3 weeks ago
- A sample pattern for running CI tests on Modal☆13Updated 2 months ago
- a graph definition and execution library for python☆16Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆12Updated 3 months ago
- ☆14Updated last year
- Modified Beam Search with periodical restart☆12Updated 2 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Hugging Face and Pyserini interoperability☆19Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆26Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated 9 months ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Low-Rank Adaptation of Large Language Models clean implementation☆9Updated last year
- Answer questions against collections stored in LLM using Retrieval Augmented Generation☆24Updated 9 months ago
- LocalAI website, powered by Hugo☆14Updated 11 months ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- ☆12Updated 7 months ago
- Visionner turn raw image data into numpy array, more suitable for deep learning task☆10Updated last year
- Visual search interface☆11Updated 2 years ago
- Ultra-minimal autoregressive diffusion model for image generation☆15Updated last month