graphcore / distributed-kge-poplar
The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise the WikiKG90Mv2 dataset
☆18Updated 3 weeks ago
Alternatives and similar repositories for distributed-kge-poplar:
Users that are interested in distributed-kge-poplar are comparing it to the libraries listed below
- Training hybrid models for dummies.☆21Updated 3 months ago
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆16Updated last year
- Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"☆19Updated this week
- Minimum Description Length probing for neural network representations☆19Updated 3 months ago
- Code repo for MathAgent☆15Updated last year
- 🧮 Algebraic Positional Encodings.☆12Updated 4 months ago
- Evaluation of neuro-symbolic engines☆35Updated 9 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆18Updated this week
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 6 months ago
- MPI Code Generation through Domain-Specific Language Models☆13Updated 5 months ago
- ☆10Updated 2 years ago
- ☆14Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆39Updated last year
- Latent Large Language Models☆18Updated 8 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model☆14Updated last year
- The compressor-retriever architecture for language model OS☆16Updated 8 months ago
- ☆18Updated last year
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆24Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- Elevate your language models with insightful diversity metrics.☆11Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 7 months ago
- ☆11Updated 2 months ago
- ☆25Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 11 months ago
- Code associated to papers on superposition (in ML interpretability)☆27Updated 2 years ago