Solving data for LLMs - Create quality synthetic datasets!
☆151Jan 20, 2025Updated last year
Alternatives and similar repositories for dataformer
Users that are interested in dataformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Small, simple agent task environments for training and evaluation☆19Nov 1, 2024Updated last year
- ☆31Jan 18, 2025Updated last year
- A reading list on LLM based Synthetic Data Generation 🔥☆1,537Jun 5, 2025Updated 11 months ago
- ☆32Jul 5, 2024Updated last year
- Opensource, personal & local chat interface for language models.☆13Jun 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch Implementation of Context-Aware Sequential Model for Multi-Behaviour Recommendation https://arxiv.org/abs/2312.09684☆10May 31, 2024Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆53Oct 4, 2024Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆356Jun 2, 2025Updated 11 months ago
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated last month
- awesome synthetic (text) datasets☆332Jan 8, 2026Updated 4 months ago
- ☆10Oct 24, 2024Updated last year
- OO for LLMs☆911Updated this week
- A toolkit for building computer use AI agents☆194Jun 26, 2025Updated 10 months ago
- An attribution library for LLMs☆46Sep 17, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18Mar 10, 2026Updated 2 months ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated last year
- ☆162Dec 2, 2024Updated last year
- the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8☆570Oct 10, 2024Updated last year
- 🤖 Headless IDE for AI agents☆204Oct 9, 2025Updated 7 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆486Sep 27, 2024Updated last year
- Manipulating Python Programs☆711Jan 14, 2026Updated 4 months ago
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆132Oct 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ☆68May 23, 2025Updated 11 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- ☆16Apr 30, 2024Updated 2 years ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆847Jan 28, 2025Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,111Feb 2, 2025Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- ☆17Mar 22, 2024Updated 2 years ago
- Flexible and powerful multi-agent AI framework☆406Mar 26, 2026Updated last month
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆451Feb 13, 2024Updated 2 years ago
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…☆521Dec 20, 2024Updated last year
- Makes it easy to use altair from FastHTML☆28Oct 9, 2024Updated last year