danielsobrado / llm_notebooksLinks
Concepts and examples on using and training LLMs
☆48Updated 5 months ago
Alternatives and similar repositories for llm_notebooks
Users that are interested in llm_notebooks are comparing it to the libraries listed below
Sorting:
- Examples on how to use LangChain and Ray☆233Updated 2 years ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- ☆163Updated 11 months ago
- Self-host LLMs with vLLM and BentoML☆167Updated last week
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆117Updated last year
- ☆75Updated last year
- A collection of all available inference solutions for the LLMs☆94Updated 11 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- Benchmarking the serving capabilities of vLLM☆58Updated last year
- Bottoms Up Development with LlamaIndex - Building a Documentation Chatbot☆188Updated 2 years ago
- ☆67Updated 10 months ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆161Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 3 weeks ago
- ☆321Updated 2 years ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆130Updated last year
- ☆185Updated 2 years ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆78Updated last year
- Excel spreadsheet crawler and table parser for data extraction and querying☆164Updated 11 months ago
- One click templates for inferencing Language Models☆227Updated 2 months ago
- LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.☆45Updated 2 years ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆73Updated last year
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 4 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated last year
- ☆44Updated last year
- ☆89Updated 2 years ago
- Classify data instantly using an LLM☆279Updated last year
- ☆46Updated 2 years ago
- Designed for offline use, this RAG application template offers a starting point for building your own local RAG pipeline, independent of …☆47Updated last year