mitcommlab / Coding-DocumentationLinks
☆20Updated 6 years ago
Alternatives and similar repositories for Coding-Documentation
Users that are interested in Coding-Documentation are comparing it to the libraries listed below
Sorting:
- Step by step development of a streaming pipeline in Python☆13Updated 2 years ago
- ☆52Updated 3 weeks ago
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer☆280Updated 6 months ago
- ☆15Updated 10 months ago
- Your opinionated Python SDMX library☆19Updated this week
- Demo of Streamlit application with Databricks SQL Endpoint☆35Updated 3 years ago
- Public facing work samples for technical hiring assessment☆20Updated 2 years ago
- A complete pipeline to pull data from Scryfall's "Magic: The Gathering"-API, via Prefect orchestration and dbt transformation.☆43Updated 2 years ago
- 📚 Process PDFs, Word documents and more with spaCy☆847Updated 11 months ago
- Binder to the cosmograph visual analytics for big graphs☆136Updated last week
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,641Updated 9 months ago
- ☆193Updated 4 years ago
- Proxy solution to run elegant Web UIs or interact with LLMs natively inside databricks notebooks.☆29Updated last year
- Data Wrangler extension for Visual Studio Code☆573Updated 2 months ago
- Python client for Trino☆411Updated 4 months ago
- ☆35Updated 6 years ago
- Apache Airflow Best Practices, published by Packt☆50Updated last year
- A curated list of awesome DataOps tools☆225Updated last month
- A Streamlit Graph Vis☆475Updated 2 weeks ago
- csv and flat-file sniffer built in Rust.☆45Updated 2 years ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆140Updated 2 years ago
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆628Updated last week
- From Pandas Dataframe To SQL Table using Psycopg2☆61Updated 3 years ago
- Create and manipulate Tableau Hyper files from Apache Spark DataFrames and Spark SQL☆31Updated last month
- Use AWS Lambda to Pull E-Scooter and E-Bike Location Data, store in S3 & Redshift using Data Vault Data Model, Server to Google Data Stud…☆16Updated 3 years ago
- A Python vector database you just need - no more, no less.☆642Updated last year
- MLOps Deploy Solutions with Rust☆38Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆144Updated 2 years ago
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆47Updated last year
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆35Updated last month