Skip to content
View shb8086's full-sized avatar
🐱
Paws deep in data!
🐱
Paws deep in data!
  • Zalando
  • Berlin, Germany

Highlights

  • Pro

Block or report shb8086

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shb8086/README.md

Typing SVG

Data is only as good as what you do with it

Medium LinkedIn

Who Am I?

  • I transform data with dbt, SQL, Python, Scala and Spark
  • I live in the clouds: AWS, GCP, Databricks, Snowflake
  • I orchestrate workflows with Airflow and Prefect
  • I load data with dlt, Fivetran, and Stitch, from raw sources into the warehouse, reliably
  • I build reliable lakehouses on Delta Lake and Apache Iceberg
  • I design data models that scale: Kimball, Data Vault, Medallion, SCD Types

Engineering

  • I build source → staging → intermediate → mart layers with clear contracts between them
  • I define metrics once and trust them everywhere using dbt Semantic Layer and MetricFlow
  • I write tests because data quality matters: schema tests, custom tests, Elementary, Anomalo
  • I track column-level data lineage so nobody asks "where does this number come from?"
  • I treat analytics code like software: CI/CD, versioned transformations, structured code review
  • I write PR descriptions that explain the why, the impact, and what to watch out for, not just the what
  • I document models, sources, and exposures so teams don't need me to answer their questions
  • I enforce data contracts between producers and consumers to catch breaking changes early
  • I catalog and govern data with DataHub, Collibra, and Unity Catalog so data is discoverable and trusted
  • I optimize until things run fast and cost less: query profiling, clustering, incremental models
  • I experiment with DuckDB & ClickHouse for fast analytics

Output

  • I make dashboards that people actually use: Looker, Tableau, Superset
  • I monitor everything with Grafana, Elementary, Anomalo
  • I translate business needs into data solutions
  • I do graphic design on the side (Adobe suite enthusiast)

Popular repositories Loading

  1. paper_localizatin_unet paper_localizatin_unet Public

    Master Thesis

    Python

  2. banking_system banking_system Public

    A simple console-based banking system implemented in C++.

    C++

  3. breast_cancer_localization breast_cancer_localization Public

    Breast Cancer Segmentation and Classification with UNet

    Jupyter Notebook

  4. image_captioning image_captioning Public

    Jupyter Notebook

  5. shb8086 shb8086 Public

  6. dbt-core dbt-core Public

    Forked from dbt-labs/dbt-core

    dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

    Python