google-deepmind / transformer_ngrams
☆13Updated 2 months ago
Alternatives and similar repositories for transformer_ngrams:
Users that are interested in transformer_ngrams are comparing it to the libraries listed below
- ☆15Updated last month
- LLM training in simple, raw C/CUDA☆14Updated last month
- ☆21Updated 3 months ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆23Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 2 months ago
- Evaluation of neuro-symbolic engines☆34Updated 5 months ago
- Minimum Description Length probing for neural network representations☆18Updated last week
- Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.☆28Updated this week
- ☆27Updated 6 months ago
- ☆31Updated 9 months ago
- Exploration into the Firefly algorithm in Pytorch☆33Updated 4 months ago
- ☆14Updated 3 months ago
- Training hybrid models for dummies.☆16Updated this week
- Modular, scalable library to train ML models☆52Updated this week
- ☆44Updated this week
- Latent Large Language Models☆17Updated 4 months ago
- ☆12Updated 4 months ago
- Learn online intrinsic rewards from LLM feedback☆33Updated last month
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 3 months ago
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆12Updated last month
- ☆40Updated 3 weeks ago
- Alpha-Zero Connect Four NN trained via self play☆13Updated 3 months ago
- Understanding how features learned by neural networks evolve throughout training☆32Updated 2 months ago
- BH hackathon☆14Updated 9 months ago
- Collection of tests performed during the study of the new Kolmogorov-Arnold Neural Networks (KAN)☆35Updated 3 months ago
- ☆46Updated last month
- ☆19Updated 5 months ago
- ☆16Updated 2 months ago
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 6 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 2 months ago