graphcore / Gradient-HuggingFaceLinks
Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace
☆16Updated last year
Alternatives and similar repositories for Gradient-HuggingFace
Users that are interested in Gradient-HuggingFace are comparing it to the libraries listed below
Sorting:
- A framework for few-shot evaluation of autoregressive language models.☆12Updated 2 months ago
- ☆54Updated 10 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆41Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- Open Implementations of LLM Analyses☆107Updated 11 months ago
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 9 months ago
- Data preparation code for Amber 7B LLM☆93Updated last year
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- ☆34Updated last month
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 9 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 9 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- An introduction to LLM Sampling☆79Updated 9 months ago
- train with kittens!☆62Updated 10 months ago
- ☆19Updated last month
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- ☆46Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆46Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆192Updated last year
- ☆142Updated 2 weeks ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆52Updated last month
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆112Updated last month
- The Foundation Model Transparency Index☆82Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- KV Cache Steering for Inducing Reasoning in Small Language Models☆39Updated last month