nomic-ai / nomic
Interact, analyze and structure massive text, image, embedding, audio and video datasets
☆1,557Updated this week
Alternatives and similar repositories for nomic:
Users that are interested in nomic are comparing it to the libraries listed below
- ☆1,453Updated last year
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,470Updated last year
- A tiny library for coding with large language models.☆1,224Updated 8 months ago
- Official supported Python bindings for llama.cpp + gpt4all☆1,020Updated last year
- A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.☆1,389Updated last month
- A language for constraint-guided and efficient LLM programming.☆3,851Updated 9 months ago
- LLM(😽)☆1,660Updated last month
- Agent techniques to augment your LLM and push it beyong its limits☆1,570Updated 9 months ago
- A school for camelids☆1,210Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,460Updated 6 months ago
- API to the GPT4All Datalake☆390Updated last year
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,920Updated last month
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated last year
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/☆4,210Updated this week
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,848Updated last year
- Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engin…☆3,429Updated last month
- 🧠 Motorhead is a memory and information retrieval server for LLMs.☆866Updated 3 months ago
- An open-source visual programming environment for battle-testing prompts to LLMs.☆2,529Updated this week
- ☆3,304Updated last year
- Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.☆2,216Updated this week
- Salesforce open-source LLMs with 8k sequence length.☆717Updated last month
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,671Updated 3 months ago
- LLM as a Chatbot Service☆3,307Updated last year
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,811Updated 6 months ago
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sour…☆2,640Updated 5 months ago
- Evaluation tool for LLM QA chains☆1,070Updated last year
- Alpaca dataset from Stanford, cleaned and curated☆1,540Updated last year
- 🦙 Integrating LLMs into structured NLP pipelines