tsmatz / finetune_llm_with_loraLinks
Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)
☆27Updated last month
Alternatives and similar repositories for finetune_llm_with_lora
Users that are interested in finetune_llm_with_lora are comparing it to the libraries listed below
Sorting:
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆77Updated last year
- ☆84Updated last year
- LoRA and DoRA from Scratch Implementations☆210Updated last year
- Research on Tabular Foundation Models☆55Updated 8 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆174Updated 5 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆64Updated last year
- Distributed training (multi-node) of a Transformer model☆80Updated last year
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆52Updated last year
- a curated list of the role of small models in the LLM era☆104Updated 11 months ago
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆177Updated 11 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Course Project for CS224W at Stanford☆22Updated 3 years ago
- ☆319Updated last year
- Playground for Transformers☆52Updated last year
- Sandbox to analysis and benchmarking Graph Transformer Neural Networks☆35Updated 2 years ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆130Updated 9 months ago
- minimal GRPO implementation from scratch☆96Updated 5 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- Code for our paper "Graph Language Models"☆74Updated last year
- A curated paper list on LLM reasoning.☆89Updated last year
- ☆29Updated last year
- ☆37Updated 2 years ago
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆202Updated last year
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆168Updated 8 months ago
- Multimodal Graph Learning: how to encode multiple multimodal neighbors with their relations into LLMs☆64Updated last year
- ☆32Updated 10 months ago
- Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data☆412Updated 8 months ago
- several types of attention modules written in PyTorch for learning purposes☆53Updated 11 months ago
- Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine,…☆19Updated last year