RayFernando1337 / LLM-Calc
Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models for inference.
β102Updated 3 weeks ago
Related projects β
Alternatives and complementary repositories for LLM-Calc
- β70Updated this week
- π€ Headless IDE for AI agentsβ133Updated this week
- Official homepage for "Self-Harmonized Chain of Thought"β83Updated 2 months ago
- β60Updated 3 weeks ago
- Gradio based tool to run opensource LLM models directly from Huggingfaceβ87Updated 4 months ago
- Routing on Random Forest (RoRF)β84Updated last month
- 90% of what you need for LLM app development. Nothing you don't.β79Updated this week
- β104Updated 8 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcherβ150Updated 2 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasksβ162Updated last month
- Transcribe and summarize videos using whisper and llms on apple mlx frameworkβ70Updated 9 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fastβ58Updated 3 weeks ago
- AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process:β¦β69Updated last month
- β94Updated 2 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ116Updated 9 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.β110Updated 6 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generateβ¦β36Updated 2 months ago
- For LLMs to better code with Jina APIβ108Updated last week
- Dabbling with ReAct chatbotsβ164Updated 3 months ago
- A simple Python program to implement the search-extract-summarize flow.β197Updated this week
- A fork of OpenAI Swarm that supports Groq and Anthropicβ85Updated last month
- Chat with any website on your local machineβ71Updated 4 months ago
- β76Updated 8 months ago
- Solving data for LLMs - Create quality synthetic datasets!β137Updated last month
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automaβ¦β45Updated last month
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing dβ¦β137Updated 7 months ago
- RAG example using DSPy, Gradio, FastAPIβ66Updated 7 months ago
- β112Updated this week
- Building Blocks for Multi-Modal Gradio Powered by Groq Appsβ88Updated 2 weeks ago