shawwn / openai-server
OpenAI API webserver
☆183Updated 3 years ago
Alternatives and similar repositories for openai-server:
Users that are interested in openai-server are comparing it to the libraries listed below
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- ☆128Updated 2 years ago
- Embeddings focused small version of Llama NLP model☆103Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 5 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- Inference code for LLaMA models☆188Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- SoTA Transformers with C-backend for fast inference on your CPU.☆311Updated last year
- Codebase topic modeling using GNNs(Node aggregation and clustering)☆61Updated last year
- An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.☆329Updated 7 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…☆115Updated 3 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types☆393Updated last month
- Prompt programming with FMs.☆440Updated 6 months ago
- Drop in replacement for OpenAI, but with Open models.☆153Updated last year
- A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)☆445Updated last year
- The code we currently use to fine-tune models.☆113Updated 9 months ago
- ☆199Updated last year
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆206Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆164Updated 3 weeks ago
- Self-extracting GPT prompts for ~70% token savings☆220Updated last year
- ☆173Updated 2 years ago
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆92Updated last year
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆157Updated last year
- AI sends pull requests for features you request in natural language☆113Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated last year
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year