SocialTensor / SocialTensorSubnetLinks
☆20Updated 10 months ago
Alternatives and similar repositories for SocialTensorSubnet
Users that are interested in SocialTensorSubnet are comparing it to the libraries listed below
Sorting:
- ☆56Updated 6 months ago
- A pipeline parallel training script for LLMs.☆166Updated 9 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Updated last year
- ☆17Updated last year
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…☆82Updated last year
- ☆27Updated 2 years ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆157Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- ☆51Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- ☆23Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆249Updated last year
- entropix style sampling + GUI☆27Updated last year
- Train Llama Loras Easily☆31Updated 2 years ago
- Synthetic Role-Play Conversation Dataset Generation☆49Updated 2 years ago
- ☆50Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ☆101Updated 2 years ago
- Implementation of DoRA☆306Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- A benchmark for role-playing language models☆115Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- LoRA and DoRA from Scratch Implementations☆215Updated last year
- ☆52Updated 2 years ago
- Enhancing Translation with RAG-Powered Large Language Models☆89Updated last month
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆110Updated 8 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year