jquesnelle / transformers-openai-api
An OpenAI Completions API compatible server for NLP transformers models
☆63Updated last year
Alternatives and similar repositories for transformers-openai-api:
Users that are interested in transformers-openai-api are comparing it to the libraries listed below
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- ☆38Updated last year
- Routing on Random Forest (RoRF)☆112Updated 4 months ago
- ☆65Updated 8 months ago
- entropix style sampling + GUI☆25Updated 3 months ago
- A framework for evaluating function calls made by LLMs☆36Updated 6 months ago
- Track the progress of LLM context utilisation☆53Updated 6 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 10 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆74Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated this week
- ☆52Updated 8 months ago
- Evaluation of bm42 sparse indexing algorithm☆64Updated 7 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆33Updated last year
- Let's create synthetic textbooks together :)☆73Updated last year
- ☆20Updated last year
- Completion After Prompt Probability. Make your LLM make a choice☆73Updated 3 months ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- Scripts to create your own moe models using mlx☆86Updated 11 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 6 months ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆44Updated 4 months ago
- ☆111Updated last month
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- ☆74Updated last year
- Evaluate your LLM apps, RAG pipeline, any generated text, and more!Updated 9 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- ☆37Updated last year