kabachuha / nanoGPKANTLinks
Testing KAN-based text generation GPT models
☆17Updated last year
Alternatives and similar repositories for nanoGPKANT
Users that are interested in nanoGPKANT are comparing it to the libraries listed below
Sorting:
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆24Updated 9 months ago
- https://mlabonne.github.io/blog/☆47Updated 2 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆61Updated last year
- Tools for formatting large language model prompts.☆13Updated last year
- This is the code that went into our practical dive using mamba as information extraction☆53Updated last year
- a version of baby agi using dspy and typed predictors☆17Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 4 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 11 months ago
- Torch-activation, a library of activation functions for PyTorch library☆25Updated 2 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆36Updated last year
- Implementation of the Mamba SSM with hf_integration.☆56Updated 10 months ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year
- A collection of optimizers for MLX☆37Updated this week
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- PyTorch implementation for MRL☆19Updated last year
- ☆47Updated last year
- ☆26Updated 7 months ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆151Updated 10 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 6 months ago
- ☆22Updated last year
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- ☆48Updated 5 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year