sbnb-io / gemma3n-profilingLinks
Profiling Google Gemma 3n Model Using PyTorch Profiler
☆16Updated 5 months ago
Alternatives and similar repositories for gemma3n-profiling
Users that are interested in gemma3n-profiling are comparing it to the libraries listed below
Sorting:
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆605Updated 2 weeks ago
- Docs for GGUF quantization (unofficial)☆340Updated 5 months ago
- InferX: Inference as a Service Platform☆143Updated this week
- The Fastest Way to Fine-Tune LLMs Locally☆330Updated last week
- ☆28Updated 6 months ago
- Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.☆475Updated last week
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆53Updated 8 months ago
- Enhancing LLMs with LoRA☆197Updated 2 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆257Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆933Updated 6 months ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆150Updated 5 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆1,399Updated this week
- Sparse Inferencing for transformer based LLMs☆215Updated 4 months ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆957Updated 6 months ago
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆674Updated 11 months ago
- world's stupidest moe llm in 103M parameters☆19Updated 5 months ago
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,086Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆676Updated 9 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆452Updated 4 months ago
- ☆85Updated 3 weeks ago
- Big & Small LLMs working together☆1,230Updated this week
- An associative memory system that stores and retrieves experiences using the 5W1H framework (Who, What, When, Where, Why, How) and conten…☆175Updated 3 months ago
- ☆135Updated 7 months ago
- Curate High Quality Datasets, Train, Evaluate and Ship! 🚀☆753Updated this week
- ☆83Updated 9 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆351Updated last year
- ☆1,222Updated this week
- ☆201Updated 3 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- API Server for Transformer Lab☆81Updated last month