Andrei-Aksionov / nanoGPTplus
β45Updated 11 months ago
Alternatives and similar repositories for nanoGPTplus:
Users that are interested in nanoGPTplus are comparing it to the libraries listed below
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β66Updated 3 months ago
- π Datasets and models for instruction-tuningβ232Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- Multi-Domain Expert Learningβ67Updated last year
- 4 bits quantization of SantaCoder using GPTQβ53Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β82Updated last year
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β74Updated last year
- Large Language Model (LLM) Inference API and Chatbotβ124Updated 9 months ago
- inference code for mixtral-8x7b-32kseqlenβ99Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hubβ156Updated last year
- Reimplementation of the task generation part from the Alpaca paperβ119Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β104Updated 8 months ago
- Collection of recipes aiding Gen AI model developmentβ92Updated last week
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytesβ¦β146Updated last year
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)β237Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated 8 months ago
- Experiments with generating opensource language model assistantsβ97Updated last year
- β199Updated 11 months ago
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAGβ313Updated 2 months ago
- Completion After Prompt Probability. Make your LLM make a choiceβ73Updated 2 months ago
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β137Updated 6 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ225Updated 3 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ34Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMsβ257Updated 3 months ago
- The Next Generation Multi-Modality Superintelligenceβ70Updated 4 months ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pileβ115Updated last year
- a tiny, exploitable chatbot that can use toolsβ30Updated last year
- β34Updated last year