Bamboo-7B Large Language Model
☆94Mar 28, 2024Updated 2 years ago
Alternatives and similar repositories for Bamboo
Users that are interested in Bamboo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High-speed and easy-use LLM serving framework for local deployment☆155Aug 7, 2025Updated 10 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- High-speed Large Language Model Serving for Local Deployment☆9,548May 11, 2026Updated last month
- ☆12Mar 27, 2026Updated 2 months ago
- A bare metal AArch64 hello-world program, that is run in a KVM AArch64 VM.☆29Jan 6, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory☆38Apr 19, 2023Updated 3 years ago
- The project now is moved to github.com/SJTU-IPADS/ServerlessBench. An open-sourced benchmark suite for serverless computing☆22May 20, 2022Updated 4 years ago
- Addendum to FAST15 Paper: Analysis of the ECMWF Storage Landscape☆14Apr 9, 2015Updated 11 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆100Apr 2, 2026Updated 2 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆16Nov 1, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Improving the tail latency of in-memory transactional system.☆10Mar 27, 2022Updated 4 years ago
- GPU operators for sparse tensor operations☆37Mar 11, 2024Updated 2 years ago
- ☆29Mar 17, 2025Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Mar 27, 2024Updated 2 years ago
- ☆12May 30, 2025Updated last year
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated 2 years ago
- ☆11Feb 20, 2025Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A benchmark suite for serverless computing☆233Feb 24, 2025Updated last year
- Open-Channel SSD emulator using memory☆22Nov 1, 2017Updated 8 years ago
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Sep 5, 2024Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- A wallpaper engine web wallpaper☆24Jan 6, 2023Updated 3 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆106Jul 9, 2025Updated 11 months ago
- Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and pr…☆46Nov 14, 2025Updated 7 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆61Apr 20, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- RWKV-7 mini☆12Mar 29, 2025Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- [OSDI 2024] Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory☆50Mar 3, 2024Updated 2 years ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- [ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks☆41Feb 4, 2025Updated last year
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- ☆17May 22, 2023Updated 3 years ago