AI-Northstar-Tech / openai-proxyLinks
Proxy server for quota, usage monitoring and tracking of OpenAI requests
☆16Updated 2 years ago
Alternatives and similar repositories for openai-proxy
Users that are interested in openai-proxy are comparing it to the libraries listed below
Sorting:
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated 11 months ago
- Data extraction with LLM on CPU☆68Updated 2 years ago
- Tools for formatting large language model prompts.☆13Updated 2 years ago
- Sentence Embedding as a Service☆15Updated 7 months ago
- Record and replay LLM interactions for langchain☆82Updated last year
- Demo example of consumer goods categorization☆30Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Updated 11 months ago
- An open-source cloud-native of large multi-modal models (LMMs) serving framework.☆165Updated 2 years ago
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated 10 months ago
- POC Port of the openai-realtime-console to streamlit.☆53Updated last year
- ☆12Updated last week
- Develop, evaluate and monitor LLM applications at scale☆100Updated last year
- ☆107Updated 2 years ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Deploy and Scale LLM-based applications☆26Updated 2 years ago
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated 2 years ago
- Data Questionnaire Agent Chatbot☆71Updated this week
- Querypls🛠️: WebApp that Simplify SQL with Your Prompts. Transforming questions into SQL commands effortlessly.☆76Updated 6 months ago
- Creating Generative AI Apps which work☆17Updated 9 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Chat Markup Language conversation library☆55Updated 2 years ago
- LLM finetuning☆42Updated 2 years ago
- ☆20Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- ☆21Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- Very minimal (and stateless) agent framework☆44Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year