stanford-futuredata / Megatron-LM
Ongoing research training transformer models at scale
☆33Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Megatron-LM
- Simple examples using Argilla tools to build AI☆38Updated this week
- ☆48Updated last year
- look how they massacred my boy☆53Updated 3 weeks ago
- Not financial advice.☆27Updated last year
- Routing on Random Forest (RoRF)☆82Updated last month
- RAG example using DSPy, Gradio, FastAPI☆64Updated 6 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆32Updated 5 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- ☆55Updated 11 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆41Updated last month
- ☆22Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆56Updated 3 months ago
- Demo of ConversationEntityMemory in Streamlit.☆51Updated last year
- Simple Graph Memory for AI applications☆79Updated 3 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆28Updated 10 months ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 8 months ago
- Verbosity control for AI agents☆56Updated 5 months ago
- ☆103Updated 7 months ago
- BH hackathon☆14Updated 7 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 5 months ago
- Score LLM pretraining data with classifiers☆55Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆42Updated 7 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆57Updated 6 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- ☆36Updated 3 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆23Updated 11 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆62Updated last month