MuhammadBinUsman03 / Real-Time-3-pipeline-LLM-Financial-AdvisorLinks
3-Pipeline LLMOps Financial advisor. Steaming pipeline deployed on AWS, 24/7 collects, embeds live-data into QdrantDB. Training pipeline finetunes model on serverless GPU and logs best model on WandB Registry.Inference pipeline downloads best model from registry for inference, utilizes Langchain to maintain history and context retrieval.
☆24Updated 8 months ago
Alternatives and similar repositories for Real-Time-3-pipeline-LLM-Financial-Advisor
Users that are interested in Real-Time-3-pipeline-LLM-Financial-Advisor are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆115Updated 6 months ago
- Multimodal AI workloads: batch inference, model training and online serving.☆104Updated 3 months ago
- GenAI Experimentation☆59Updated 3 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…☆142Updated 3 weeks ago
- ☆33Updated last year
- A collection of hand on notebook for LLMs practitioner☆51Updated 11 months ago
- Test LLMs automatically with Giskard and CI/CD☆31Updated last year
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆72Updated 3 months ago
- 💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML☆139Updated 9 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- ☆15Updated 2 years ago
- How to serve ML predictions 100x faster☆59Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 5 months ago
- Let's discover films.☆27Updated 8 months ago
- ☆148Updated last year
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚☆195Updated this week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 8 months ago
- Optimized Large Language Models for Financial Applications – Efficient, Scalable, and Domain-Specific AI for Finance.☆50Updated 5 months ago
- ☆57Updated last year
- Scripts, notebooks, and articles about data science in general.☆53Updated 2 years ago
- Fine tuning ModernBERT-embed-base on synthetic domain specific data for improvement to unseen queries☆49Updated 7 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Leetcode Intensive tutorial and study guide generated by llama-index, networkx, scikit-learn and pydantic☆114Updated last year
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- ☆15Updated last year
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data …☆70Updated 2 years ago
- Various installation guides for Large Language Models☆77Updated 7 months ago
- Material for the series of seminars on Large Language Models☆34Updated last year