huu4ontocord / MDELView external linksLinks
Multi-Domain Expert Learning
☆67Jan 23, 2024Updated 2 years ago
Alternatives and similar repositories for MDEL
Users that are interested in MDEL are comparing it to the libraries listed below
Sorting:
- Adversarial Training and SFT for Bot Safety Models☆40Apr 18, 2023Updated 2 years ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Mar 22, 2023Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆50Jul 10, 2023Updated 2 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Dec 22, 2023Updated 2 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- ☆15Jul 13, 2025Updated 7 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- A curated list of Natural Language Generation papers, tutorials, and blogs.☆12Dec 13, 2018Updated 7 years ago
- Yet another LLM command line interface☆16Dec 9, 2024Updated last year
- MagmaGlass is a selfhosted alternative to Obsidian Publish, written in Laravel. It's a looking glass for your lava rocks.☆19Feb 9, 2024Updated 2 years ago
- ☆63Sep 23, 2024Updated last year
- ☆33Jul 31, 2024Updated last year
- Code repository for the c-BTM paper☆108Sep 26, 2023Updated 2 years ago
- ☆18Dec 18, 2022Updated 3 years ago
- Tool for executing python on AWS instances☆19Jan 25, 2017Updated 9 years ago
- Experiments on content generation & machine "intelligence"☆15Dec 28, 2016Updated 9 years ago
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Sep 7, 2023Updated 2 years ago
- ☆21Oct 6, 2023Updated 2 years ago
- ☆20Jul 12, 2023Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆144Sep 10, 2023Updated 2 years ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆161Sep 26, 2023Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Oct 9, 2022Updated 3 years ago
- Big-Interleaved-Dataset☆58Jan 21, 2023Updated 3 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Oct 17, 2023Updated 2 years ago
- Vector-Space Markov Random Fields☆21May 13, 2015Updated 10 years ago
- Anh - LAION's multilingual assistant datasets and models☆27Apr 5, 2023Updated 2 years ago
- Code base for internal reward models and PPO training☆24Oct 1, 2023Updated 2 years ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆73Updated this week
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- clean up your LLM datasets☆114May 30, 2023Updated 2 years ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆27Apr 21, 2023Updated 2 years ago
- Retro styled terminal shell☆26May 8, 2024Updated last year