Quantized inference code for LLaMA models
โ13Mar 12, 2023Updated 3 years ago
Alternatives and similar repositories for llama-int8
Users that are interested in llama-int8 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference code for LLaMA modelsโ189Mar 6, 2023Updated 3 years ago
- ๐งฎ Polynomial Calculatorโ12Jan 3, 2023Updated 3 years ago
- A Lambda expression compiler targeting web assembly.โ20Aug 7, 2024Updated last year
- The Parsec Command Line Interfaceโ14May 31, 2024Updated last year
- a browser gui for nvidia smiโ21Mar 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ์ํ ์ฆ๋ช ์ธ์ด Agda ์ ๋ฌธโ11May 2, 2023Updated 3 years ago
- Nvidia GPU Fan Controller for linuxโ15May 27, 2024Updated last year
- Experimental high precision n-body simulation for small nโ13Nov 29, 2015Updated 10 years ago
- Example project that shows how to use Antlr with Unityโ14Nov 8, 2015Updated 10 years ago
- โ18Oct 30, 2013Updated 12 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.โ13Apr 21, 2022Updated 4 years ago
- AgenticSearch operates within an agentic workflow, utilizing Gemini 2.0 and an extensive tool registry to handle complex questions. By inโฆโ30Jan 16, 2025Updated last year
- An ANSI C Vector library (Dynamic Array) that is fully configurable, fast, thread safe, reentrant, can store dynamic data structures as wโฆโ23Apr 30, 2024Updated 2 years ago
- A performance-oriented prototyping harness for state of the art Molecular Dynamics algorithmsโ17Apr 27, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer โข AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- refinement types for Elmโ16Jul 12, 2023Updated 2 years ago
- This is the boilerplate to produce a book and an ebook with Pandoc.โ12Aug 7, 2015Updated 10 years ago
- Adds support for searching the current line (in normal vi mode) to zsh.โ14Apr 30, 2019Updated 7 years ago
- Converting text-LMs into Visual Language Modelsโ63Jan 31, 2026Updated 3 months ago
- MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models, sloppily ported to cog/replicateโ12Apr 25, 2023Updated 3 years ago
- Real-time Computational Fluid Dynamics (C/C++, wxWidget)โ17Aug 21, 2023Updated 2 years ago
- Handlebars helper which allows you to group lists by a property of each item.โ10Apr 23, 2019Updated 7 years ago
- Sublime Open Shading Languageโ16Feb 23, 2019Updated 7 years ago
- Central Authentication Service strategy for รberauthโ16Feb 1, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- โ10Mar 14, 2025Updated last year
- Screen space global illumination for interactive mixed realityโ14Dec 13, 2017Updated 8 years ago
- A tool to create genomic reports based on 23andMe data.โ17Mar 8, 2024Updated 2 years ago
- This Python script converts your entire Obsidian vault with markdown documents into HTML documents in a new folder with identical structuโฆโ20May 20, 2025Updated 11 months ago
- A framework & a platform for building production-ready AI Agentsโ15Jun 22, 2025Updated 10 months ago
- This is my attempt to convert libnoise to the unreal engine for use in random maps and texturesโ11Jun 13, 2019Updated 6 years ago
- Attempt to implement personal mail server using python+aiosmtpdโ14Sep 14, 2017Updated 8 years ago
- Machine Learning course project to convert a source voice into a target voice.โ12May 26, 2018Updated 7 years ago
- Mesh generation from sparse matricesโ23Nov 5, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Generate C++ method implementations from declarationsโ19Aug 5, 2025Updated 9 months ago
- Set a ROS navigation goal using latitude and longitude.โ10Nov 22, 2020Updated 5 years ago
- CUDA-enabled ollama nix flakeโ14Mar 11, 2024Updated 2 years ago
- A Dell thermal management GUI to control fan speeds and monitor temperaturesโ23Aug 8, 2023Updated 2 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.โ10Apr 12, 2021Updated 5 years ago
- Arduino library and hardware files for the SX1509 IO Expander Breakout board.โ20Jan 27, 2022Updated 4 years ago
- Simple example of UE4(Unreal Engine 4 ) flowmap shaderโ11Sep 11, 2017Updated 8 years ago