avilum / yallaLinks

A tiny LLM Agent with minimal dependencies, focused on local inference.

☆53

Alternatives and similar repositories for yalla

Users that are interested in yalla are comparing it to the libraries listed below

Sorting:

AlmogBaku / openai-streaming
Work with OpenAI's streaming API at ease with Python generators
☆121Updated last year
AlmogBaku / pytest-evals
A pytest plugin for running and analyzing LLM evaluation tests.
☆127Updated 4 months ago
raptor-ml / raptor
Transform your pythonic research to an artifact that engineers can deploy easily.
☆154Updated last week
zipnn / zipnn
A Lossless Compression Library for AI pipelines
☆250Updated 2 months ago
ilanbenb / wa_llm
A WhatsApp bot that can participate in group conversations, powered by AI. The bot monitors group messages and responds when mentioned.
☆78Updated last week
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆86Updated 11 months ago
marcusschiesser / open-swarm
Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.
☆92Updated 8 months ago
inferx-net / inferx
InferX is a Inference Function as a Service Platform
☆111Updated last week
ajac-zero / midrasai
☆100Updated 7 months ago
langsec-ai / langsec
Security framework for LLM-generated SQL queries 🛡️
☆30Updated 7 months ago
anyscale / llm-router
Tutorial for building LLM router
☆211Updated 11 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆183Updated last year
normal-computing / extended-mind-transformers
☆121Updated 10 months ago
mangate / SelfGPT
A Whatsapp-bot that allows access to GPT3 while also serving as your own memory backup
☆135Updated 2 years ago
uripeled2 / llm-client-sdk
SDK for using LLM
☆80Updated last year
gilad-rubin / hypster
HyPSTER - HyperParameter optimization on STERoids
☆48Updated 7 months ago
BrunoGeorgevich / llama3.cp
Adapted version of llama3.np (NumPy) to a CuPy implementation for the Llama 3 model.
☆35Updated last year
aviveldan / datagov-mcp
MCP server for Israel Government Data
☆60Updated this week
run-ai / runai-model-streamer
☆222Updated this week
brizzai / auto-mcp
Transform any OpenAPI/Swagger definition into a fully-featured Model Context Protocol (MCP) server
☆152Updated 2 weeks ago
cognitivecomputations / kraken
☆66Updated last year
Arize-ai / openinference
OpenTelemetry Instrumentation for AI Observability
☆480Updated this week
abgulati / hf-waitress
Serving LLMs in the HF-Transformers format via a PyFlask API
☆71Updated 9 months ago
hunch-app / declarai
A Pythonic integration for LLMs.
☆88Updated last year
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 8 months ago
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated 11 months ago
monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆51Updated last year
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆193Updated 11 months ago
cognitivecomputations / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
hide-org / hide
🤖 Headless IDE for AI agents
☆191Updated 2 months ago