donaldafeith / Pytorch_MergeLinks
Merge LLM that are split in to parts
☆25Updated last year
Alternatives and similar repositories for Pytorch_Merge
Users that are interested in Pytorch_Merge are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆49Updated 6 months ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated last year
- ☆72Updated last year
- ☆63Updated 8 months ago
- Multi-Domain Expert Learning☆66Updated last year
- Implementation of the Mamba SSM with hf_integration.☆56Updated 9 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- entropix style sampling + GUI☆26Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆34Updated 11 months ago
- ☆15Updated last year
- Finetune any model on HF in less than 30 seconds☆57Updated last month
- ☆32Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆31Updated last year
- ☆23Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- ☆37Updated 2 years ago
- ☆53Updated last year
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- ☆26Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year