di37 / LLM-Load-Unload-Ollama
This is a simple demonstration to show how to keep an LLM loaded for prolonged time in the memory or unloading the model immediately after inferencing when using it via Ollama.
☆13Updated 9 months ago
Alternatives and similar repositories for LLM-Load-Unload-Ollama:
Users that are interested in LLM-Load-Unload-Ollama are comparing it to the libraries listed below
- Flan-t5-base model was fine-tuned on Nvidia Question and Answer Pair Dataset available on Kaggle. This is a beginner level project who wa…☆22Updated 10 months ago
- It includes the concepts for RAG application from basics till advanced using LangChain library.☆16Updated 10 months ago
- Question Answering System API based on all of the Harry Potter Books that will allow to answer all the events that took please in the Har…☆13Updated last year
- Chatbot implementation using ChatGPT API and Gradio.☆13Updated last year
- ☆12Updated 8 months ago
- This projects aims to show how whisper model can be fine-tuned on language it was not trained but is trained on similar language to it.☆11Updated 9 months ago
- This repository contains a project that focuses on evaluating the performance of different Language Models (LLMs) for multi-class news cl…☆16Updated 8 months ago
- This project demonstrates how to utilize Codellama, a local open-source Large Language Model (LLM), and customize its behavior according …☆33Updated 11 months ago
- AI-powered YouTube Notes Generator: Create detailed notes from YouTube videos. Streamlit UI for easy use.☆43Updated 6 months ago
- build_startup_using_AI_Agents☆42Updated 7 months ago
- A Simple Scenes Based Movie Generation App☆49Updated 3 months ago
- A repository Payman + Langgraph integration examples that allow AI Agent to simply create tasks for Humans on Payman that pay them money …☆79Updated 4 months ago
- Fine-tuning Llama-3-8B on the MathInstruct dataset☆30Updated 6 months ago
- A Newsletter Agent that Aggregates Articles and Generates a Newsletter - Langflow, NextJS☆46Updated 2 months ago
- LLM Siri with OpenAI, Perplexity, Ollama, Llama2, Mistral, Mixtral & Langchain☆58Updated last year
- Access your Ollama inference server running on your computer from anywhere. Set up with NextJS + Langchain JS LCEL + Ngrok☆26Updated last year
- A MCP server connecting to a managed index on LlamaCloud☆37Updated last month
- An awesome & curated list of best LLMOps tools for developers☆24Updated last year
- ⚡Ship RAG Solutions Quickly and effortlessly☆121Updated 5 months ago
- ☆41Updated 10 months ago
- ☆17Updated 10 months ago
- A couple scripts to grab stats from email☆41Updated 5 months ago
- An AI Clone For Any X Profile☆80Updated 2 months ago
- Create chatbot and AI agent workflows with unified access.☆43Updated this week
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆24Updated 5 months ago
- Chrome extension that interacts with content using Groq☆41Updated last month
- blablado is an extensible Assistant that listens to your voice and can execute custom Python functions you provided. It can speak as well…☆72Updated 6 months ago
- Agent that summarizes the Linkedin Articles into Post content☆34Updated 4 months ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year