llama.cpp to PyTorch Converter
☆38Apr 8, 2024Updated 2 years ago
Alternatives and similar repositories for llama-cpp-torch
Users that are interested in llama-cpp-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- QuIP quantization☆66Mar 17, 2024Updated 2 years ago
- A sample Next.js website to showcase how you can make invite-only SPAs with Next.js, AirTable & Vercel☆19Aug 18, 2024Updated last year
- JAX implementations of RWKV☆18Sep 26, 2023Updated 2 years ago
- Simple tool for partial optimization of ONNX. Further optimize some models that cannot be optimized with onnx-optimizer and onnxsim by se…☆19May 7, 2024Updated 2 years ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…☆15Feb 25, 2026Updated 3 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated last year
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 10 months ago
- An implementation of the hammer2 filesystem for Plan 9☆19Nov 25, 2018Updated 7 years ago
- A custom classloader to be used from maven generated artifacts to allow executable jars and custom exclusion of some libraries at runtime…☆15Sep 13, 2016Updated 9 years ago
- ☆20May 22, 2025Updated last year
- Mixed precision training from scratch with Tensors and CUDA☆30May 14, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Python gRPC client library for Vald☆18Apr 16, 2026Updated last month
- Simple Application Sandboxing☆23Aug 9, 2024Updated last year
- ☆21Mar 3, 2025Updated last year
- GGUF parser in Python☆28May 1, 2026Updated 3 weeks ago
- Visual Embeddings with OpenAI and Nomic☆13Aug 7, 2023Updated 2 years ago
- ☆17Jan 31, 2025Updated last year
- This is the public repo of the code from ReasonKGE☆16Sep 18, 2021Updated 4 years ago
- ☆35Feb 8, 2024Updated 2 years ago
- ☆125Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- tiny/turbo/throttling HTTP server☆36Sep 10, 2019Updated 6 years ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- A simple sparse bitmap implementation in java☆22Jan 28, 2016Updated 10 years ago
- Get the aligned BERT embedding for sequence labeling tasks☆18Jun 6, 2019Updated 6 years ago
- ☆12Oct 9, 2024Updated last year
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- ☆25Jun 26, 2024Updated last year
- (Mirror: moved to https://gitlab.esss.lu.se/ecdc/ess-dmsc/event-formation-unit) Implementation of neutron event formation pipeline for ES…☆12Mar 20, 2026Updated 2 months ago
- Code for my collection of predictors/classifiers/etc☆14Jul 18, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- tenstorrent kernel from twitch☆28Mar 16, 2024Updated 2 years ago
- A rough and ready Python utility which splits audio files based on silence and desired min/max chunk duration.☆16Jun 22, 2022Updated 3 years ago
- Improved ESIM event camera simulator☆17Oct 4, 2024Updated last year
- ☆43Aug 2, 2025Updated 9 months ago
- If it quacks like a tensor...☆59Nov 13, 2024Updated last year
- SoC for CQU Dual Issue Machine☆12Sep 20, 2022Updated 3 years ago
- A text classification and similairty computing project in Python.We have tried wordbag,word2vec,WordMoverDistance,N-gram,LSTM,C-LSTM, LST…☆11May 18, 2019Updated 7 years ago