Patch for MPT-7B which allows using and training a LoRA
☆58May 20, 2023Updated 2 years ago
Alternatives and similar repositories for mpt-lora-patch
Users that are interested in mpt-lora-patch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104May 20, 2025Updated 10 months ago
- ☆34Apr 23, 2023Updated 2 years ago
- Makes llama.cpp easy to use.☆12May 14, 2025Updated 10 months ago
- ☆17Jun 20, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Jun 13, 2023Updated 2 years ago
- Image Diffusion block merging technique applied to transformers based Language Models.☆56May 8, 2023Updated 2 years ago
- Generate High Quality textual or multi-modal datasets with Agents☆18Jun 7, 2023Updated 2 years ago
- Chat Example Application Using SvelteJS and Chat☆17Dec 16, 2020Updated 5 years ago
- ☆15Oct 31, 2023Updated 2 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- This repo helps to transform text into a better form for lora training☆13Apr 9, 2023Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Semantic search engine indexing 110 million academic publications☆103Jan 19, 2026Updated 2 months ago
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆125Apr 20, 2023Updated 2 years ago
- Self-verification for LLMs.☆67Jul 22, 2023Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆125Jun 16, 2023Updated 2 years ago
- Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratch☆43May 20, 2025Updated 10 months ago
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- This repository contains the data and code created under the project NLP4Rare-cm-uc3m.☆10Sep 14, 2021Updated 4 years ago
- ☆12Oct 3, 2024Updated last year
- MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …☆14Jan 16, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Integrate an LLM copilot within your Keras model development workflow☆28Sep 23, 2023Updated 2 years ago
- Samples for fine-tuning HuggingFace models with AzureML☆10Oct 14, 2021Updated 4 years ago
- RIT-assistant☆13Sep 8, 2017Updated 8 years ago
- ☆13Feb 26, 2023Updated 3 years ago
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- A simple life simulation game developed in Unity, where the player can watch a group of entities move, grow, search, do pathfinding, eat,…☆13Dec 2, 2020Updated 5 years ago
- Lossless normalization of uppercase characters☆11Jul 3, 2023Updated 2 years ago
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆26Mar 13, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- Embeddings focused small version of Llama NLP model☆107Apr 27, 2023Updated 2 years ago
- Fill up the `model_list` field in your LiteLLM proxy configuration file☆10Sep 7, 2024Updated last year
- ☆17Oct 1, 2024Updated last year
- Yet another LLM☆10Apr 6, 2023Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Jan 12, 2025Updated last year
- The ChatGPT Chrome Extension is a general-purpose extension that utilizes the OpenAI GPT model to provide suggestions based on user input…☆12Apr 22, 2024Updated last year