rania-hossam / LLAMA_FROM_SCRATCH_PYTORCHLinks
β16Updated last year
Alternatives and similar repositories for LLAMA_FROM_SCRATCH_PYTORCH
Users that are interested in LLAMA_FROM_SCRATCH_PYTORCH are comparing it to the libraries listed below
Sorting:
- A set of scripts and notebooks on LLM finetunning and dataset creationβ111Updated 8 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 7 months ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.β85Updated 2 years ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningβ64Updated 10 months ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β131Updated last year
- Pre-training code for Amber 7B LLMβ166Updated last year
- Distributed training (multi-node) of a Transformer modelβ68Updated last year
- Code for NeurIPS LLM Efficiency Challengeβ58Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated 3 weeks ago
- minimal GRPO implementation from scratchβ90Updated 2 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- β87Updated last year
- Codebase accompanying the Summary of a Haystack paper.β78Updated 8 months ago
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the β¦β58Updated last year
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)β¦β67Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- β36Updated 2 weeks ago
- Repository containing awesome resources regarding Hugging Face tooling.β47Updated last year
- Data preparation code for Amber 7B LLMβ91Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β108Updated last year
- Official implementation for 'Extending LLMsβ Context Window with 100 Samples'β78Updated last year
- β47Updated 9 months ago
- experiments with inference on llamaβ104Updated last year
- Benchmark suite for LLMs from Fireworks.aiβ75Updated 3 weeks ago
- β29Updated 6 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkIβ94Updated 2 years ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β54Updated 8 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flβ¦β75Updated 9 months ago
- Training and Fine-tuning an llm in Python and PyTorch.β42Updated last year
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.β62Updated 4 months ago