Runs LLaMA with Extremely HIGH speed
☆95Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for fast-llama
Users that are interested in fast-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- Efficient 3bit/4bit quantization of LLaMA models☆18May 18, 2023Updated 3 years ago
- Lightweight frontend library for GHC with JavaScript Backend☆18Dec 17, 2024Updated last year
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 11 months ago
- Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops☆30Mar 16, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆16May 13, 2024Updated 2 years ago
- 动手写全文搜索引擎☆10Aug 12, 2020Updated 5 years ago
- ☆17Mar 8, 2020Updated 6 years ago
- ☆14Feb 7, 2024Updated 2 years ago
- [CVPR 2024 Highlight] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering☆10Jul 29, 2024Updated last year
- Learning about CUDA by writing PTX code.☆159Feb 27, 2024Updated 2 years ago
- Arbitrage screener with live market simulations and data logging to find perfect pairs.☆10Dec 2, 2022Updated 3 years ago
- RViz plugin to display normal vectors of points in a point cloud, if available.☆17May 4, 2023Updated 3 years ago
- A computationally efficient and robust LiDAR-inertial odometry (LIO) package☆13Aug 4, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Evaluate mapping quality using intrinsic and extrinsic metrics☆12Mar 17, 2022Updated 4 years ago
- This source code (in Python) is a preliminary implementation of my quadratic-time positive integer matrix multiplication.☆10Nov 23, 2022Updated 3 years ago
- PyTorch implementation of joint coordinate and sparse parametric encodings for offline RGB-D surface reconstruction☆19May 13, 2023Updated 3 years ago
- Revisiting Fast and Accurate RGB-D Odometry for Real-World Use by Embracing Simplicity☆31Aug 31, 2025Updated 9 months ago
- Safari Reader Mode Source Code☆20Mar 5, 2024Updated 2 years ago
- A header-only C++ library for fitting and optimizing uniform B-splines using the Ceres Solver.☆14Jun 2, 2026Updated last week
- ⚡ Running online SfM 🌐 while capturing images 📸☆32Sep 27, 2025Updated 8 months ago
- Context-aware LLM Translator (CALT)☆51Jan 8, 2025Updated last year
- ROEVO: Robust Organized Edge Feature-based Visual Odometry Using RGB-D Cameras☆30Aug 26, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Jan 5, 2025Updated last year
- A project demonstration on how to use the GigE camera to do the DeepStream Yolo3 object detection, how to set up the GigE camera, and dep…☆17Feb 11, 2022Updated 4 years ago
- Automatically annotates YOLO dataset using Moondream visual model☆21Aug 24, 2025Updated 9 months ago
- notes on langchain☆18Mar 20, 2026Updated 2 months ago
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- ☆17Sep 2, 2023Updated 2 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- A novel media player that allows you to navigate by speaker☆103Mar 25, 2026Updated 2 months ago
- This code converts a point cloud obtained by a Velodyne VLP16 3D-Lidar sensor into a depth image mono16.☆13Aug 29, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Yet another `llama.cpp` Rust wrapper☆11May 31, 2026Updated last week
- Individual-tree isolation (treeiso) from terrestrial laser scanning☆18Sep 21, 2025Updated 8 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 11 months ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- Differentiable ICP implementation for learning tasks.☆19Feb 3, 2025Updated last year
- Timedomain-Ai-Singer omniverse插件☆21Feb 15, 2023Updated 3 years ago
- Code execution runtime for the STAC Overflow: Map Floodwater from Radar Imagery competition☆12Sep 29, 2021Updated 4 years ago