rania-hossam / LLAMA_FROM_SCRATCH_PYTORCHLinks
β16Updated last year
Alternatives and similar repositories for LLAMA_FROM_SCRATCH_PYTORCH
Users that are interested in LLAMA_FROM_SCRATCH_PYTORCH are comparing it to the libraries listed below
Sorting:
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 8 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ110Updated 9 months ago
- experiments with inference on llamaβ104Updated last year
- β92Updated last year
- minimal GRPO implementation from scratchβ90Updated 3 months ago
- Prune transformer layersβ69Updated last year
- β47Updated 10 months ago
- Distributed training (multi-node) of a Transformer modelβ72Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last month
- Code for NeurIPS LLM Efficiency Challengeβ59Updated last year
- Supercharge huggingface transformers with model parallelism.β77Updated 8 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β59Updated last month
- Exploring finetuning public checkpoints on filter 8K sequences on Pileβ115Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)β101Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Open Implementations of LLM Analysesβ104Updated 8 months ago
- π οΈ Tools for Transformers compression using PyTorch Lightning β‘β83Updated 7 months ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.β85Updated 2 years ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningβ64Updated 10 months ago
- Like picoGPT but for BERT.β50Updated 2 years ago
- β77Updated last year
- β47Updated 7 months ago
- β23Updated last year
- β68Updated 10 months ago
- Data preparation code for Amber 7B LLMβ91Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ261Updated 11 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.β47Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ52Updated last year