BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
β145Updated 2 months ago
Alternatives and similar repositories for dataformer:
Users that are interested in dataformer are comparing it to the libraries listed below
- Routing on Random Forest (RoRF)β136Updated 6 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBβ119Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β264Updated 3 months ago
- β85Updated 6 months ago
- A user interface for DSPyβ142Updated 5 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?β56Updated 2 weeks ago
- model activation visualiserβ90Updated this week
- β99Updated 7 months ago
- β208Updated 9 months ago
- Finetune Llama-3-8b on the MathInstruct datasetβ108Updated 5 months ago
- β150Updated 4 months ago
- β76Updated 9 months ago
- β144Updated 3 weeks ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ224Updated 11 months ago
- Fast parallel LLM inference for MLXβ177Updated 8 months ago
- Prompt design in Pythonβ55Updated 4 months ago
- β111Updated 3 months ago
- βοΈ Awesome LLM Judges βοΈβ87Updated last month
- β84Updated 2 months ago
- Simple examples using Argilla tools to build AIβ53Updated 4 months ago
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backedβ128Updated 10 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing dβ¦β141Updated 11 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddβ¦β97Updated 2 months ago
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β121Updated 5 months ago
- Train your own SOTA deductive reasoning modelβ81Updated 3 weeks ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated 11 months ago
- A list of AI memory projectsβ85Updated 2 months ago
- A Lightweight Library for AI Observabilityβ238Updated last month
- An automated tool for discovering insights from research papaer corporaβ137Updated 9 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agentsβ107Updated this week