kabachuha / nanoGPKANT
Testing KAN-based text generation GPT models
☆16Updated 11 months ago
Alternatives and similar repositories for nanoGPKANT:
Users that are interested in nanoGPKANT are comparing it to the libraries listed below
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- alternative way to calculating self attention☆18Updated 11 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- ☆26Updated 4 months ago
- Rust bindings for CTranslate2☆14Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 3 months ago
- A collection of optimizers for MLX☆35Updated this week
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- ☆61Updated last year
- Training hybrid models for dummies.☆20Updated 3 months ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 6 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- ☆27Updated 9 months ago
- Latent Large Language Models☆17Updated 8 months ago
- a WIP architecture designed to allow transformers to think in a manner without tokens☆19Updated last year
- aesthetic tensor visualiser☆15Updated this week
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 8 months ago
- a version of baby agi using dspy and typed predictors☆17Updated last year
- ☆51Updated last month
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 6 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- finetuning shakespeare on karpathy/nanoGPT☆19Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- A synthetic story narration dataset to study small audio LMs.☆32Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- BH hackathon☆14Updated last year