mallik3006 / LLM_fine_tuning_llama3_8bLinks
Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed
☆18Updated last year
Alternatives and similar repositories for LLM_fine_tuning_llama3_8b
Users that are interested in LLM_fine_tuning_llama3_8b are comparing it to the libraries listed below
Sorting:
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated 10 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆47Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 8 months ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆40Updated last week
- ☆76Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆58Updated last month
- ☆47Updated 4 months ago
- ☆92Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆45Updated 9 months ago
- Scripts for text classification with llama and bert☆19Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆64Updated last year
- Collection of autoregressive model implementation☆85Updated 2 months ago
- Model implementation for the contextual embeddings project☆33Updated 3 weeks ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- "Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Comput…☆25Updated 3 months ago
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- ☆47Updated 9 months ago
- ☆23Updated last year
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆65Updated 4 months ago
- 🚢 Data Toolkit for Sailor Language Models☆92Updated 4 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Code for KaLM-Embedding models☆78Updated 3 months ago