AI4Bharat / NeMoLinks
☆19Updated 2 months ago
Alternatives and similar repositories for NeMo
Users that are interested in NeMo are comparing it to the libraries listed below
Sorting:
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 7 months ago
- Fine tune Gemma 3 on an object detection task☆74Updated 3 weeks ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆82Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Updated last week
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Updated last year
- Roboflow's inference server to analyze video streams. This project extracts insights from video frames at defined intervals and generates…☆14Updated last year
- Python client SDK for Ultravox.☆15Updated 4 months ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- ☆28Updated 3 weeks ago
- Web Interface for Vision Language Models Including InternVLM2☆22Updated last year
- Faster Whisper with additional features☆46Updated 5 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated last year
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆28Updated 5 months ago
- Multimodal AI App using Llava 7B and Gradio.☆40Updated last year
- ☆21Updated 9 months ago
- faster-whisper as serverless endpoint☆112Updated 2 months ago
- Composition of Multimodal Language Models From Scratch☆15Updated 11 months ago
- The Swarm Ecosystem☆22Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆18Updated 2 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆21Updated 10 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆18Updated 11 months ago
- API Server for Transformer Lab☆69Updated this week
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆16Updated 7 months ago
- Retrieval-Augmented Generation (RAG) over a Large Language Model (LLM) For PDF data extraction☆27Updated last year
- Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆13Updated 8 months ago
- The next evolution of Agents☆47Updated 3 weeks ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆158Updated this week
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year