MuhammadBinUsman03 / Real-Time-3-pipeline-LLM-Financial-AdvisorLinks
3-Pipeline LLMOps Financial advisor. Steaming pipeline deployed on AWS, 24/7 collects, embeds live-data into QdrantDB. Training pipeline finetunes model on serverless GPU and logs best model on WandB Registry.Inference pipeline downloads best model from registry for inference, utilizes Langchain to maintain history and context retrieval.
☆23Updated 4 months ago
Alternatives and similar repositories for Real-Time-3-pipeline-LLM-Financial-Advisor
Users that are interested in Real-Time-3-pipeline-LLM-Financial-Advisor are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- 💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML☆138Updated 5 months ago
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- Multimodal AI workloads: batch inference, model training and online serving.☆52Updated last week
- ☆33Updated 9 months ago
- Optimized Large Language Models for Financial Applications – Efficient, Scalable, and Domain-Specific AI for Finance.☆51Updated last month
- A collection of hand on notebook for LLMs practitioner☆50Updated 7 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…☆124Updated 3 weeks ago
- GenAI Experimentation☆57Updated last week
- Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚☆192Updated 8 months ago
- Test LLMs automatically with Giskard and CI/CD☆30Updated last year
- Example code and notebooks related to mlflow, llmops, etc.☆43Updated last year
- Various installation guides for Large Language Models☆72Updated 4 months ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆29Updated 7 months ago
- A Hands-on Practical Guide to LlamaIndex☆33Updated 10 months ago
- How to serve ML predictions 100x faster☆58Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆113Updated 5 months ago
- ☆20Updated last year
- Leetcode Intensive tutorial and study guide generated by llama-index, networkx, scikit-learn and pydantic☆114Updated last year
- ☆35Updated 9 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆11Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- 📚 Tutorial on building a modern search app for Amazon e-commerce products leveraging tabular semantic search and natural language querie…☆83Updated 4 months ago
- Enterprise-grade memory framework for LLMs featuring GPU-optimized inference, vector storage, and automated scaling. Enables hyper-person…☆89Updated 3 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated last month
- ☆15Updated 2 years ago
- Material for the series of seminars on Large Language Models☆34Updated last year
- A template to kick-start your Python project ✨🚀☆52Updated last month
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Google☆88Updated 4 months ago
- ☆24Updated 8 months ago