spather / transformer-experiments
Some experiments on transformer models
β11Updated last year
Alternatives and similar repositories for transformer-experiments:
Users that are interested in transformer-experiments are comparing it to the libraries listed below
- β34Updated last week
- NLP with Rust for Python π¦πβ62Updated 10 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated last year
- β28Updated 5 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ35Updated last year
- Run LLMs on Replicate with vLLMβ18Updated 6 months ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- Gzip and nearest neighbors for text classificationβ56Updated last year
- β41Updated 2 months ago
- Framework for building and maintaining self-updating prompts for LLMsβ61Updated 10 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ39Updated 2 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ18Updated last year
- A text-to-SQL prototype on the northwind sqlite datasetβ12Updated 7 months ago
- Verbosity control for AI agentsβ62Updated 11 months ago
- β77Updated 10 months ago
- An introduction to LLM Samplingβ77Updated 4 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 3 weeks ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ24Updated last year
- π€ Trade any tensors over the networkβ30Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebookβ14Updated 3 weeks ago
- Python library to use Pleias-RAG modelsβ27Updated this week
- Pre-train Static Word Embeddingsβ56Updated 2 weeks ago
- β22Updated 11 months ago
- β19Updated 6 months ago
- β38Updated 9 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ30Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.β50Updated 6 months ago
- Latent Large Language Modelsβ17Updated 8 months ago