ray-project / llms-in-prod-workshop-2023Links
Deploy and Scale LLM-based applications
☆26Updated 2 years ago
Alternatives and similar repositories for llms-in-prod-workshop-2023
Users that are interested in llms-in-prod-workshop-2023 are comparing it to the libraries listed below
Sorting:
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year
- A personal knowledge base that I can dump information to and help me learn☆24Updated last month
- Leverage your LangChain trace data for fine tuning☆41Updated 10 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- ☆32Updated last year
- ☆20Updated 8 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- The backend behind the LLM-Perf Leaderboard☆10Updated last year
- Interface for interacting with Gradient AI in Python☆14Updated 11 months ago
- Run code-llama with 50k tokens using flash attention and better transformer☆12Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- Github repo for storing LlamaDatasets☆34Updated last year
- A collection of examples demonstrating how to use dstack☆26Updated last year
- ☆77Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 weeks ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated last year
- ☆78Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- Chat Markup Language conversation library☆55Updated last year
- Nearest Neighbors vs Approximate Nearest Neighbors☆25Updated 2 years ago
- ☆27Updated last year
- Retrieval Augmented Generation applications☆26Updated last year
- ☆38Updated last year
- Sentence Embedding as a Service☆15Updated last year
- ☆59Updated last year
- Example of running LangChain on Cloud Run☆61Updated 2 years ago
- Verbosity control for AI agents☆63Updated last year