Skip to content
View mgorsk1's full-sized avatar
  • Warsaw, Poland

Block or report mgorsk1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mgorsk1/README.md

Hi, I'm Mariusz✌🏻

~ whoami

I work as a Tech Lead in ING Advanced Analytics, where I am a part of a team contributing to Data Analytics Platform.

My current area of influence is within the data cataloging, discoverability and lineage.

In the past I was heavily involved in:

  • Writing and extending frameworks for data ingestion
  • Writing and extending frameworks for data quality / profiling

I have strong experience in distributed systems, leveraging modern technologies such as:

  • Kubernetes
  • Apache Airflow
  • Apache Spark
  • Apache Superset
  • Confluent Kafka
  • Elastic Stack (formely ELK)

Certifications

I hold following certificates:

  • Google Cloud Professional Cloud Architect (PCA)
  • Google Cloud Associate Engineer (AE)
  • Kubernetes Certified Application Developer (CKAD)

Conducting trainings πŸŽ“

I also am a trainer for a Polish training company Sages where I am responsible for conducting Elastic Stack and Spark related trainings. So far I've conducted 30+ trainings for over 250 people.

Open Source

My experience revolves mostly around Open Source technologies, towards which I have a strong fondness. I am proud to be a contributor/maintainer for:

  • πŸ” Amundsen (LF AI) /maintainer/ - a data discovery and metadata engine for improving the productivity of data analysts, data scientists and engineers when interacting with data
  • πŸ” OpenMetadata /contributor/ - an all-in-one platform for data discovery, data lineage, data quality, observability, governance, and team collaboration
  • 🌬️ Apache Airflow /contributor/ - a platform to programmatically author, schedule and monitor workflows
  • 🌍 Apache Atlas /contributor/ - a metadata governance framework

Activity

Medium Stories

Although rather seldom, I sometimes write medium stories:

My personal projects

My hobbies

  • β˜• Coffee
  • 🚴 Cycling
  • πŸ”΄ Snooker

Find me elsewhere

Pinned Loading

  1. garbage-detector-app garbage-detector-app Public

    Jupyter Notebook 3

  2. snooker snooker Public

    snooker is a Python package providing thin wrapper over publicly available API for retrieving Snooker statistics.

    Python 3 1

  3. brryle brryle Public

    A simple search engine demonstrating full-text search capabilities of Elasticsearch.

    JavaScript 1

  4. amundsen amundsen Public

    Forked from amundsen-io/amundsen

    Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data – πŸ—ƒπŸ•΅οΈβ€β™€οΈ

    Smarty

  5. pw-bigdata-project-python pw-bigdata-project-python Public

    Jupyter Notebook

  6. pw-bigdata-project-scala pw-bigdata-project-scala Public

    Streaming

    Python