graphcore / Gradient-HuggingFaceLinks
Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace
☆16Updated last year
Alternatives and similar repositories for Gradient-HuggingFace
Users that are interested in Gradient-HuggingFace are comparing it to the libraries listed below
Sorting:
- A framework for few-shot evaluation of autoregressive language models.☆12Updated 3 months ago
- ☆36Updated 2 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- An introduction to LLM Sampling☆79Updated 10 months ago
- ☆55Updated 11 months ago
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- ☆23Updated 2 years ago
- ☆102Updated 9 months ago
- Multi-Domain Expert Learning☆66Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- QuIP quantization☆59Updated last year
- ☆43Updated last year
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆55Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆46Updated 2 years ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆224Updated last month
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆42Updated last year
- A repository for research on medium sized language models.☆78Updated last year
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …