hamelsmu / llama-inferenceLinks
experiments with inference on llama
â104Updated 11 months ago
Alternatives and similar repositories for llama-inference
Users that are interested in llama-inference are comparing it to the libraries listed below
Sorting:
- Manage scalable open LLM inference endpoints in Slurm clustersâ257Updated 10 months ago
- Lightweight demos for finetuning LLMs. Powered by đ¤ transformers and open-source datasets.â77Updated 7 months ago
- đšī¸ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.â137Updated 10 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)â101Updated last year
- Multipack distributed sampler for fast padding-free training of LLMsâ188Updated 9 months ago
- â198Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oâĻâ134Updated last week
- Datasets collection and preprocessings framework for NLP extreme multitask learningâ182Updated 4 months ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pileâ114Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.â81Updated last year
- Code for NeurIPS LLM Efficiency Challengeâ58Updated last year
- Data preparation code for Amber 7B LLMâ90Updated last year
- batched lorasâ343Updated last year
- Experiments on speculative sampling with Llama modelsâ126Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.â37Updated last year
- A framework for evaluating function calls made by LLMsâ37Updated 10 months ago
- â121Updated last month
- Small and Efficient Mathematical Reasoning LLMsâ71Updated last year
- Experiments with generating opensource language model assistantsâ97Updated 2 years ago
- FastFit ⥠When LLMs are Unfit Use FastFit ⥠Fast and Effective Text Classification with Many Classesâ206Updated 3 weeks ago
- ReLM is a Regular Expression engine for Language Modelsâ105Updated 2 years ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.â187Updated last year
- đĸ Data Toolkit for Sailor Language Modelsâ91Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationâ63Updated last year
- QLoRA with Enhanced Multi GPU Supportâ37Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creationâ111Updated 8 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeâ230Updated 7 months ago
- Pre-training code for Amber 7B LLMâ166Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.â34Updated 3 weeks ago
- â49Updated last year