seungrokj / ai_sprint_parisLinks
☆12Updated this week
Alternatives and similar repositories for ai_sprint_paris
Users that are interested in ai_sprint_paris are comparing it to the libraries listed below
Sorting:
- A sample pattern for running CI tests on Modal☆18Updated 2 months ago
- ☆13Updated 7 months ago
- A small python library to run iterators in a separate process☆10Updated last year
- Simple repository for training small reasoning models☆33Updated 5 months ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- PyTorch implementation for MRL☆18Updated last year
- ☆30Updated 7 months ago
- Because it's there.☆16Updated 9 months ago
- A read heavy distributed key-value storage system which enables clients to perform read and write operations efficiently.☆8Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- LLM attention pattern visualizer☆10Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 9 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆11Updated 3 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last week
- efficient query encoding for dense retrieval☆11Updated 11 months ago
- ☆28Updated 3 weeks ago
- ☆18Updated last month
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated last week
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 5 months ago
- Training hybrid models for dummies.☆23Updated 5 months ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆45Updated last year
- ☆18Updated last year
- Experiments to assess SPADE on different LLM pipelines.☆17Updated last year
- ☆23Updated 7 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 3 months ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- ☆29Updated 2 weeks ago
- ☆38Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago