spather / transformer-experimentsLinks

Some experiments on transformer models

☆11

Alternatives and similar repositories for transformer-experiments

Users that are interested in transformer-experiments are comparing it to the libraries listed below

Sorting:

xjdr-alt / muzero_sketch
☆38Updated 11 months ago
ivanleomk / modal-grpo
☆20Updated 3 months ago
rasbt / nn_plus_gzip
Gzip and nearest neighbors for text classification
☆57Updated last year
intellectronica / battle-of-the-semantics
GraphRag vs Embeddings
☆14Updated 11 months ago
brendanhasz / dagio
A python package for running directed acyclic graphs of asynchronous I/O operations
☆16Updated 3 years ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 2 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆62Updated last month
kylegallatin / recsim
A framework for simulating e-commerce data and interactions that can be used to build recommendation systems
☆10Updated last year
BBischof / yapping
Verbosity control for AI agents
☆63Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆53Updated 4 months ago
AnswerDotAI / playwrightnb
Use sync mode Playwright interactively, inside a Jupyter notebook
☆14Updated 2 months ago
SumanthRH / python-mastery
My solutions for Advanced Python Mastery (course by @dabeaz)
☆11Updated last year
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆35Updated last year
ekshaks / ragpipe
Iterate fast on your RAG pipelines
☆23Updated last week
hamelsmu / replicate-examples
☆22Updated last year
jimmc414 / document_intelligence
Automated Document Intelligence Workflow
☆22Updated 6 months ago
v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated last year
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆20Updated last month
morgangiraud / text-to-sql-proto
A text-to-SQL prototype on the northwind sqlite dataset
☆12Updated 9 months ago
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆30Updated 9 months ago
anpaure / cp_eval
Tiny evaluation of leading LLMs on competitive programming problems
☆14Updated 7 months ago
dm4ml / motion
Framework for building and maintaining self-updating prompts for LLMs
☆63Updated last year
Avmb / inverse_scaling_prize_code_identifier_swap
Submission to the inverse scaling prize
☆23Updated last year
AnswerDotAI / toolslm
Tools to make language models a bit easier to use
☆47Updated this week
swairshah / Intensify
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Updated 8 months ago
replicate / cog-vllm
Run LLMs on Replicate with vLLM
☆20Updated 8 months ago
charlesfrye / cuda-substrings
Because it's there.
☆16Updated 9 months ago
dm4ml / gate
Drift detection module for machine learning pipelines.
☆25Updated 2 years ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 4 months ago
rbitr / llama2.ipynb
Ipython notebook copy of Andrej Karpathy's llama2.c
☆23Updated last year