ibm-granite / granite-3.0-language-models
☆207Updated this week
Related projects ⓘ
Alternatives and complementary repositories for granite-3.0-language-models
- Banishing LLM Hallucinations Requires Rethinking Generalization☆261Updated 3 months ago
- awesome synthetic (text) datasets☆239Updated 2 weeks ago
- ☆131Updated 3 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆206Updated 2 weeks ago
- Let's build better datasets, together!☆202Updated 3 months ago
- An Open Source Toolkit For LLM Distillation☆354Updated last month
- ☆92Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆173Updated last week
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆328Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆236Updated 4 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆172Updated 3 months ago
- AWM: Agent Workflow Memory☆203Updated last month
- Automatically evaluate your LLMs in Google Colab☆557Updated 6 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆160Updated last week
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆173Updated last week
- ☆148Updated 3 months ago
- ☆103Updated 2 months ago
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆289Updated 2 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆194Updated 6 months ago
- ☆149Updated 2 weeks ago
- prime (previously called ZeroBand) is a framework for efficient, globally distributed training of AI models over the internet.☆207Updated this week
- Official repo for "Make Your LLM Fully Utilize the Context"☆241Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- ☆311Updated last month
- The official evaluation suite and dynamic data release for MixEval.☆224Updated this week
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆292Updated 10 months ago
- GRadient-INformed MoE☆258Updated last month
- A compact LLM pretrained in 9 days by using high quality data☆260Updated last month
- Tutorial for building LLM router☆159Updated 3 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆216Updated 7 months ago