kabachuha / nanoGPKANT
Testing KAN-based text generation GPT models
☆15Updated 8 months ago
Alternatives and similar repositories for nanoGPKANT:
Users that are interested in nanoGPKANT are comparing it to the libraries listed below
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated 2 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated 9 months ago
- Alpha-Zero Connect Four NN trained via self play☆13Updated 3 months ago
- Latent Large Language Models☆17Updated 4 months ago
- ☆27Updated 6 months ago
- ☆60Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆26Updated this week
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆17Updated this week
- alternative way to calculating self attention☆18Updated 7 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated 11 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Training hybrid models for dummies.☆16Updated this week
- Efficiently computing & storing token n-grams from large corpora☆17Updated 3 months ago
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆12Updated last month
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 3 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 7 months ago
- Access fireworks.ai models via API☆11Updated 9 months ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 9 months ago
- ☆33Updated 11 months ago
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆14Updated 2 weeks ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 3 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated last year
- 🏥 Health monitor for a Petals swarm☆34Updated 5 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- Jax like function transformation engine but micro, microjax☆30Updated 2 months ago