graphcore / Gradient-HuggingFaceLinks
Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace
☆16Updated last year
Alternatives and similar repositories for Gradient-HuggingFace
Users that are interested in Gradient-HuggingFace are comparing it to the libraries listed below
Sorting:
- A framework for few-shot evaluation of autoregressive language models.☆12Updated 5 months ago
- ☆36Updated 5 months ago
- ☆20Updated 3 weeks ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆52Updated 10 months ago
- ☆55Updated last year
- train with kittens!☆63Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Updated 11 months ago
- Multi-Domain Expert Learning☆67Updated last year
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ☆42Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- Fluid Language Model Benchmarking☆25Updated 3 months ago
- Source code for Activated LoRA☆23Updated last month
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 2 years ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆19Updated 2 years ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Updated last year
- Code for ExploreTom☆89Updated 6 months ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated 7 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- ☆39Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Library to facilitate pruning of LLMs based on context☆32Updated last year