stanford-futuredata / Megatron-LM
Ongoing research training transformer models at scale
☆34Updated last year
Alternatives and similar repositories for Megatron-LM:
Users that are interested in Megatron-LM are comparing it to the libraries listed below
- ☆48Updated last year
- ☆60Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆36Updated 8 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- ☆41Updated 9 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Not financial advice.☆28Updated last year
- Routing on Random Forest (RoRF)☆100Updated 4 months ago
- Demo of ConversationEntityMemory in Streamlit.☆52Updated 2 years ago
- ☆46Updated 9 months ago
- Chat Markup Language conversation library☆55Updated last year
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- Score LLM pretraining data with classifiers☆54Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated 8 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Using modal.com to process FineWeb-edu data☆19Updated last month
- A framework for orchestrating AI agents using a mermaid graph☆74Updated 8 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- look how they massacred my boy☆63Updated 3 months ago
- Verbosity control for AI agents☆59Updated 8 months ago
- ☆22Updated last year
- ☆109Updated last month
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆30Updated last month
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 4 months ago
- An automated tool for discovering insights from research papaer corpora☆136Updated 7 months ago