cognitivecomputations / runpod-vllm
☆15Updated 8 months ago
Related projects: ⓘ
- Embedding models from Jina AI☆55Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆18Updated last week
- a version of baby agi using dspy and typed predictors☆17Updated 6 months ago
- Dockerized AI with CUDA. Llama-cpp-python and stable diffusion.☆0Updated 7 months ago
- Production ready extractors for transformation, extracting embedding or structured data from unstructured data sources.☆28Updated last week
- Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for system…☆32Updated last month
- Run embedding models using ONNX☆23Updated 7 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 4 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 4 months ago
- ☆45Updated this week
- ☆22Updated 2 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆40Updated 4 months ago
- Port of Facebook's LLaMA model in C/C++☆31Updated 6 months ago
- converts url content into JSON with a simple prefix☆60Updated 4 months ago
- ☆36Updated 6 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆16Updated 2 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆15Updated 3 weeks ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆53Updated 2 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 2 months ago
- ☆31Updated 8 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 4 months ago
- ☆23Updated 8 months ago
- Easily create LLM automation/agent workflows☆54Updated 7 months ago
- ☆3Updated last month
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆21Updated 2 months ago
- Chat Markup Language conversation library☆53Updated 8 months ago
- Cog wrapper for collabora/WhisperSpeech☆23Updated 6 months ago
- A high performance batching router optimises max throughput for text inference workload☆15Updated last year
- Not financial advice.☆26Updated last year
- 🐤 Canary provides UI primitives for building modern search-bar for docs with self-hostable infrastructure.☆41Updated this week