danielsobrado / llm_notebooksLinks
Concepts and examples on using and training LLMs
☆47Updated 3 months ago
Alternatives and similar repositories for llm_notebooks
Users that are interested in llm_notebooks are comparing it to the libraries listed below
Sorting:
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- ☆75Updated last year
- Awesome List of Vector DB resources☆174Updated 2 years ago
- A collection of all available inference solutions for the LLMs☆93Updated 9 months ago
- Self-host LLMs with vLLM and BentoML☆161Updated 3 weeks ago
- Examples on how to use LangChain and Ray☆232Updated 2 years ago
- DSPY on action with OpenSource LLMs.☆102Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated last year
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆78Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆73Updated 11 months ago
- Benchmarking the serving capabilities of vLLM☆58Updated last year
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆58Updated 2 years ago
- LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.☆45Updated 2 years ago
- ☆66Updated 8 months ago
- Developer samples for the KDB.AI vector database☆171Updated this week
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆116Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆48Updated last year
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 8 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 2 months ago
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- Classify data instantly using an LLM☆278Updated last year
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆76Updated last year
- Agent that routes to different tools - LLM classifier SDK☆45Updated last year
- ☆164Updated 10 months ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆134Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆131Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year