jquesnelle / transformers-openai-api
An OpenAI Completions API compatible server for NLP transformers models
☆64Updated last year
Alternatives and similar repositories for transformers-openai-api:
Users that are interested in transformers-openai-api are comparing it to the libraries listed below
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆83Updated this week
- ☆38Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 5 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated 7 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 11 months ago
- ☆65Updated 9 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- Self-host LLMs with vLLM and BentoML☆92Updated this week
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Let's create synthetic textbooks together :)☆73Updated last year
- ☆199Updated last year
- ☆152Updated 8 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- ☆54Updated 2 months ago
- entropix style sampling + GUI☆25Updated 4 months ago
- ☆75Updated last year
- ☆38Updated last year
- ☆111Updated 3 months ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- ☆33Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆102Updated 7 months ago
- Synthetic Data for LLM Fine-Tuning☆112Updated last year
- Track the progress of LLM context utilisation☆53Updated 8 months ago
- For inferring and serving local LLMs using the MLX framework☆96Updated 11 months ago
- ☆20Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year