sbnb-io / gemma3n-profilingLinks
Profiling Google Gemma 3n Model Using PyTorch Profiler
☆16Updated 6 months ago
Alternatives and similar repositories for gemma3n-profiling
Users that are interested in gemma3n-profiling are comparing it to the libraries listed below
Sorting:
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆622Updated this week
- InferX: Inference as a Service Platform☆151Updated this week
- Docs for GGUF quantization (unofficial)☆360Updated 6 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆333Updated last month
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,260Updated last week
- Big & Small LLMs working together☆1,249Updated this week
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆799Updated 3 weeks ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,182Updated last year
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆55Updated 9 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆353Updated last year
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆678Updated last year
- Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.☆477Updated this week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆843Updated last week
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆964Updated 7 months ago
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆681Updated 10 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆1,553Updated this week
- ☆135Updated 8 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆258Updated 3 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆457Updated 6 months ago
- ☆27Updated 7 months ago
- An open-source tool for LLM prompt optimization.☆754Updated 3 weeks ago
- Large-scale LLM inference engine☆1,631Updated last week
- tl/dw (Too Long, Didn't Watch): Your Personal Research Multi-Tool - a naive attempt at 'A Young Lady's Illustrated Primer' (Open Source N…☆1,220Updated this week
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆628Updated last year
- Optimizing inference proxy for LLMs☆3,299Updated this week
- Tool for generating high quality Synthetic datasets☆1,476Updated 3 months ago
- API Server for Transformer Lab☆82Updated 2 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆439Updated 4 months ago