llama-cpp-python-exploit
☆15Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for llama-cpp-python-exploit
Users that are interested in llama-cpp-python-exploit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- langchain-prompt-exploit☆14Oct 14, 2023Updated 2 years ago
- llm-agent-smith☆12Oct 15, 2023Updated 2 years ago
- pandasai-sandbox-exploit☆13Oct 14, 2023Updated 2 years ago
- Example of multi-process, multi-GPU training using Torch-parallel, nVidia-nccl, and nVidia-MPS☆17Sep 22, 2016Updated 9 years ago
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Feb 23, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scientific computing with Metal in C++: Matrix multiplication example☆49Sep 18, 2022Updated 3 years ago
- Code examples for The Java Trove series☆73Apr 1, 2020Updated 6 years ago
- Repository for the EM German Model☆112Nov 13, 2023Updated 2 years ago
- Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as…☆131Jul 15, 2021Updated 4 years ago
- The creative suite for character-driven AI experiences.☆193Sep 6, 2024Updated last year
- ☆229Updated this week
- Accompanying files for "Real world Devops project from start to finish" course☆266Dec 9, 2024Updated last year
- Exploring the scalable matrix extension of the Apple M4 processor☆231Nov 7, 2024Updated last year
- Efficient optimizers☆326May 13, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆296Updated this week
- Memory mapped numpy arrays of varying shapes☆309Feb 3, 2026Updated 3 months ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆381Apr 14, 2026Updated last month
- RetroWrite -- Retrofitting compiler passes through binary rewriting☆745Apr 26, 2025Updated last year
- ☆642Updated this week
- CLIP inference in plain C/C++ with no extra dependencies☆558Jun 19, 2025Updated 11 months ago
- FlashAttention (Metal Port)☆603Sep 22, 2024Updated last year
- Apple GPU microarchitecture☆608Sep 22, 2024Updated last year
- Run inference on MPT-30B using CPU☆575Jun 30, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A tool for bandwidth measurements on NVIDIA GPUs.☆700Apr 8, 2026Updated last month
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference☆670Updated this week
- Open-source tool to visualise your RAG 🔮☆1,218Jan 3, 2025Updated last year
- Interactive architecture diagrams for codebases☆1,671Updated this week
- Typing animations with React☆1,404Dec 7, 2022Updated 3 years ago
- A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.☆1,007May 7, 2024Updated 2 years ago
- What would you do with 1000 H100s...☆1,173Jan 10, 2024Updated 2 years ago
- llama.cpp fork with additional SOTA quants and improved performance☆2,554Updated this week
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,748May 11, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,188Aug 26, 2025Updated 9 months ago
- AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across div…☆3,190May 20, 2026Updated last week
- [ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model☆1,973Nov 15, 2024Updated last year
- Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.☆2,175Feb 18, 2025Updated last year
- Puzzles for learning Triton☆2,444Apr 1, 2026Updated last month
- Everything we actually know about the Apple Neural Engine (ANE)☆2,466Mar 12, 2026Updated 2 months ago
- Universal local privilege escalation Proof-of-Concept exploit for CVE-2024-1086, working on most Linux kernels between v5.14 and v6.6, in…☆2,446Apr 17, 2024Updated 2 years ago