neuralmagic / examples
Notebooks using the Neural Magic libraries π
β42Updated 8 months ago
Alternatives and similar repositories for examples:
Users that are interested in examples are comparing it to the libraries listed below
- β20Updated last year
- Data extraction with LLM on CPUβ68Updated last year
- Experimenting text-embeddings-inference server on both CPU andΒ GPUβ18Updated last year
- π Unstructured Data Connectors for Haystack 2.0β16Updated last year
- Large Language Model (LLM) Inference API and Chatbotβ125Updated last year
- Data extraction with LLM on CPUβ112Updated last year
- Table detection with Florence.β13Updated 8 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β80Updated 10 months ago
- Medical Mixture of Experts LLM using Mergekit.β20Updated last year
- Build Agentic workflows with function calling using open LLMsβ26Updated last week
- Some information for working with the Together inference API for Open Source AI modelsβ58Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- Data extraction with LLM on CPUβ85Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Llβ¦β16Updated 11 months ago
- Running load tests on a FastAPI application using Locustβ13Updated last week
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β23Updated last year
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabilβ¦β29Updated last year
- Embed anything.β29Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 5 months ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23β16Updated 10 months ago
- β19Updated 5 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β46Updated 6 months ago
- β31Updated last year
- β14Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1β¦β14Updated last year
- On-device LLM Inference using Mediapipe LLM Inference API.β21Updated last year
- This repository contains a toy implementation of a basic RAQA system.β20Updated 10 months ago
- Tools for merging pretrained large language models.β19Updated 9 months ago
- Tool to take your ML model from local to production with one-line of code.β25Updated last year