graphcore / Gradient-HuggingFace
Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace
☆15Updated last year
Alternatives and similar repositories for Gradient-HuggingFace:
Users that are interested in Gradient-HuggingFace are comparing it to the libraries listed below
- Training hybrid models for dummies.☆20Updated 3 months ago
- ☆48Updated 5 months ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated this week
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 11 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year
- ☆17Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Lightweight tools for quick and easy LLM demo's☆26Updated 7 months ago
- ☆46Updated 9 months ago
- Latent Large Language Models☆17Updated 8 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 6 months ago
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- ☆33Updated 2 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated 11 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆24Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆82Updated last month
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- ☆13Updated 6 months ago
- distill chatGPT coding ability into small model (1b)☆28Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆23Updated this week
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago