SocialTensor / SocialTensorSubnetLinks
☆20Updated 10 months ago
Alternatives and similar repositories for SocialTensorSubnet
Users that are interested in SocialTensorSubnet are comparing it to the libraries listed below
Sorting:
- A pipeline parallel training script for LLMs.☆166Updated 9 months ago
- ☆56Updated 6 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆157Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Updated last year
- ☆17Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- Enhancing Translation with RAG-Powered Large Language Models☆89Updated last month
- High-throughput tensor loading for PyTorch☆221Updated 2 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆249Updated last year
- ☆27Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- Train Llama Loras Easily☆31Updated 2 years ago
- Synthetic Role-Play Conversation Dataset Generation☆49Updated 2 years ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 9 months ago
- ☆141Updated 5 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- ☆51Updated last year
- Genertaes control vectors for use with llama.cpp in GGUF format.☆36Updated 10 months ago
- Arxflix turns your boring Arxiv research paper into a captivating video.☆58Updated 4 months ago
- Full finetuning of large language models without large memory requirements☆94Updated 4 months ago
- Implementation of DoRA☆306Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- ☆50Updated last year
- entropix style sampling + GUI☆27Updated last year
- ☆242Updated 4 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆110Updated 8 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆201Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Google TPU optimizations for transformers models☆135Updated 2 weeks ago