AI4Bharat / NeMoLinks
☆23Updated 8 months ago
Alternatives and similar repositories for NeMo
Users that are interested in NeMo are comparing it to the libraries listed below
Sorting:
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆22Updated 4 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated 3 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Updated last year
- ☆21Updated last year
- Python client SDK for Ultravox.☆16Updated last month
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated last year
- AI-augmented, conversational information retrieval and data exploration☆37Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated 2 years ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Updated this week
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- Roboflow's inference server to analyze video streams. This project extracts insights from video frames at defined intervals and generates…☆13Updated last year
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆38Updated 2 years ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- kokoro text to speech using javascript☆63Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆14Updated 9 months ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆39Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- Own your AI, search the web with it🌐😎☆94Updated last year
- A project that enables identification and classification of an intent of a message with dynamic labels☆50Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- ☆101Updated 3 weeks ago
- Embed anything.☆27Updated last year
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆85Updated 5 months ago
- faster-whisper as serverless endpoint☆126Updated 2 months ago
- ☆100Updated 8 months ago