saurabhaloneai / image-capLinks
image captioningggπ³
β11Updated 10 months ago
Alternatives and similar repositories for image-cap
Users that are interested in image-cap are comparing it to the libraries listed below
Sorting:
- a tiny vectorstore implementation built with numpy.β62Updated last year
- alternative way to calculating self attentionβ18Updated last year
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)β24Updated 8 months ago
- My solutions for Advanced Python Mastery (course by @dabeaz)β11Updated last year
- Build Agentic workflows with function calling using open LLMsβ28Updated last week
- Verbosity control for AI agentsβ64Updated last year
- Andrej Kapathy's micrograd implemented in cβ29Updated 11 months ago
- Hub for researchers exploring VLMs and Multimodal Learning:)β41Updated this week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β81Updated last year
- I learn about and explain quantizationβ26Updated last year
- Tool to take your ML model from local to production with one-line of code.β25Updated last year
- Apps that run on modal.comβ12Updated last week
- Rust Implementation of microgradβ52Updated last year
- BH hackathonβ14Updated last year
- β38Updated 4 months ago
- Testing paligemma2 finetuning on reasoning datasetβ18Updated 6 months ago
- in depth exploration of llm and vlms.(notes)β12Updated 10 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.β81Updated 2 months ago
- Quick Notebook Tutorialsβ32Updated 5 months ago
- Cerule - A Tiny Mighty Vision Modelβ66Updated 10 months ago
- rl from zero pretrain, can it be done? we'll see.β65Updated 3 weeks ago
- β74Updated 9 months ago
- Experimentation on google's gemma modelβ16Updated last year
- β46Updated 3 months ago
- Simple orchestration for EC2 spot containersβ19Updated 9 months ago
- Multimodal AI workloads: batch inference, model training and online serving.β22Updated 3 weeks ago
- Repository containing awesome resources regarding Hugging Face tooling.β47Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernamesβ24Updated last year
- zero-to-lightningβ29Updated last year
- An introduction to LLM Samplingβ79Updated 7 months ago