graphcore / Gradient-HuggingFaceLinks
Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace
☆16Updated last year
Alternatives and similar repositories for Gradient-HuggingFace
Users that are interested in Gradient-HuggingFace are comparing it to the libraries listed below
Sorting:
- A framework for few-shot evaluation of autoregressive language models.☆11Updated 3 weeks ago
- ☆53Updated 9 months ago
- Data preparation code for Amber 7B LLM☆91Updated last year
- ☆34Updated last week
- QuIP quantization☆55Updated last year
- Source code for Activated LoRA☆14Updated this week
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆223Updated last year
- An introduction to LLM Sampling☆79Updated 7 months ago
- ☆23Updated 2 years ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆47Updated 5 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆49Updated 9 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 2 weeks ago
- A new repo to demonstrate tutorials for using HuggingFace on Graphcore IPUs.☆12Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆105Updated 7 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Collection of autoregressive model implementation☆86Updated 3 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated 11 months ago
- Open Implementations of LLM Analyses☆105Updated 10 months ago
- A repository for research on medium sized language models.☆78Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- Library to facilitate pruning of LLMs based on context☆32Updated last year
- ☆74Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆162Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆127Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆81Updated last week