Loading
Patryk Sitarek

Data Engineer

Machine Learning

Data Analysis

Data QA

  • Bio
  • Resume
  • Skills
  • About Me
  • Contact
Patryk Sitarek

Data Engineer

Machine Learning

Data Analysis

Data QA

Resume
Hello, I’m Patryk Data Engineer

I will shape your data into insights.

Located in Poland, work for clients from all over the world.

With over 3 years of experience in data engineering and working with data. Skilled in data extraction, transformation and analysis, as well as building data processing workflows. Additional expertise in Machine Learning and Data QA.

Successfully contributed to projects for both local and Global 500 companies. This diverse experience has honed the ability to adapt to different industries and deliver tailored and scalable solutions that drive data-driven decision-making at every level.

3+

Years of
Experience

9

Completed
Projects

5

Global
Clients

Resume

Experience
2024 - Present
Data Engineer
Warsaw, Accenture

Design and implement end-to-end ETL processes for BI reporting using Microsoft Fabric and Azure cloud. Enabling Fabric capabilities to users. Developing GenAI applications.

2022 - 2023
Junior QA Data Engineer
Warsaw, BitPeak

Planning and executing data validation processes. Building ETL processes for BI reporting using Azure cloud.

2016
IT Internships
Leipzig, Vitalis GmbH

Internship, during which I was responsible for administering operating systems, database maintenance and managing the internet network.

key Projects
2025
Fabric Costs & Utilization Reporting
Fast-Moving Consumer Goods
I worked on a BI reporting product on the use of Power BI/Fabric platform resources and the resulting costs generated by individual business units.
In addition, we provided formal recommendations and tools to support effective platform management, and introduced new capabilities for the Fabric platform.

The benefits of the project are:
  • Tools providing information on resources used and costs generated
  • New Fabric capabilities enablement
  • Platform management support
2024
Migration to Microsoft Fabric and further development
Fast-Moving Consumer Goods

I worked on migration of a data product for international leader in its industry. The product is a model for tracking user adoption and engagement with Power BI initiatives and products across the organization. On further stages the model has been enriched with data related to the use of Power BI resources.


The benefits of the project are:

  • Increased performance:
    • Power BI reports responsiveness with Direct Lake
    • Processing time reduced from 8 h to 1 h
    • Data availability improved from T–2 days to T–1 day
  • Improved error and fault tolerance
  • Significantly increased data reliability
  • Model enriched with detailed insights about resources used and costs

The solution is fully based on Microsoft Fabric platform, using data pipelines, Python notebooks, data warehouse and direct lake semantic model with Power BI reports.

2024
Recommendation algorithm for supply chain management
Fast-Moving Consumer Goods

The project involved implementing a recommendation algorithm for order splitting (knapsack problem) to optimally divide the order based on product parameters and available transport fleet. The algorithm is used on the wholesale platform of a leading FMCG company worldwide.


The benefits of the project are:

  • Automatic recommendations for freight forwarders that replaced their manual work
  • Optimized transportation resources, reducing shipping costs

The solution is based on knapsack problem algorithm implemented in Python and deployed using Azure Machine Learning.

2024
GenAI-based application for customer service support and automatic review

Preparing a template for an application based on GenAI services, which, based on a chat or telephone conversation, prepares a transcription, a summary, verifies and evaluates the content of the conversation based on guidelines and proposes further actions.

The solution is based on Python with microservices architecture and Azure Cognitive Services.

2023
Building a data warehouse and BI reporting solution for sales data
Payments

I started the project as a Data QA Specialist, but from the very beginning I was fascinated by the role of a Data Engineer. In addition to QA tasks, I built data pipelines and helped with data modeling, eventually becoming the lead Data Engineer.
The product was a data warehouse built in a medallion architecture, designed to enable efficient processing of sales data from sources with limited availability while maintaining high scalability.


The benefits of the project are:

  • Unified data warehouse with aggregated data related to the company's core business. Power BI reports that analyze sales of key products.
  • Instant analysis of sales data that allows to respond to key products and customers.
  • Enabling the building of custom reports based on the data model.

The solution was based on Azure Data Factory and Synapse Analytics services, and serverless computing.

2022
Building a data warehouse for SAF-T reporting
Healthcare

It was my first professional commercial project and first experience with cloud computing. Worked as Data QA, responsible for validating data in the model, ETL processes, and automated validation rules within daily processing.


The benefits of the project are:

  • Centralized data warehouse for Standard Audit File for Tax (SAF-T) reporting
  • Reduced hundreds of hours of manual work by people to a few hours of result verification
  • Minimized the number of errors that required subsequent correction

The technology stack was Azure Cloud: Data Factory, Synapse Analytics with Delta Lake, and SQL Server. Most of the code is Python notebooks and stored procedures.

Certifications
Dec 2024
Microsoft DP-600: Fabric Analytics Engineer Associate
Microsoft Fabric

The certification validates my skills to design, build, deploy, and maintain end-to-end enterprise analytics solutions using Microsoft Fabric and Power BI, including data preparation, semantic modelling, and securing analytics assets.

Sep 2023
Microsoft AI-900: Azure AI Fundamentals
Microsoft Azure

The certification demonstrates foundational knowledge of artificial intelligence and machine learning concepts, and how to apply them using Microsoft Azure services.

Jan 2023
ISTQB® Certified Tester Foundation Level
Testing

The certification validates that an individual has a solid understanding of core software testing principles, terminology, lifecycle models, test techniques & tools, and test management.

Education
Master of Micro- and Nanotechnology
2021 - 2023
Master of Micro- and Nanotechnology
University of Silesia

Master of Science in Micro- and Nanotechnology. Minors in machine learning and electronic.

Bechelor of Computer Science
2017 - 2021
Bechelor of Computer Science
University of Silesia

Bechelor of Engineering in Computer Science. Minors in algorithms and data structures, database management and machine learning.

2013 - 2017
Technician of Computer Science
ZSOT in Lubliniec

My education in the industry began at a technical school. Today, I don't use any of the tools I was taught there, but I appreciate the foundational knowledge and paradigms I learned there.

Skills

Python 3
Python 3
90%
T-SQL, SQL Server
T-SQL, SQL Server
80%
Data Transformation
Data Transformation
75%
Data Pipelines
Data Pipelines
90%
Data Analysis
Data Analysis
75%
Machine Learning
Machine Learning
75%
Cloud Computing
Cloud Computing
75%
Data QA
Data QA
75%
Tools & Platforms
  • Microsoft Azure Cloud
    80%
  • Microsoft Fabric
    90%
  • Azure Data Factory
    90%
  • Azure Synapse Analytics
    75%
  • Azure Data Lake Storage Gen2
    80%
  • Delta Lake
    90%
  • Spark
    80%
  • Azure Machine Learning
    25%
  • Power BI
    25%
  • Pandas
    75%
  • NumPy
    75%
  • Keras
    50%
  • SciKit-Learn
    50%
  • OpenAI
    75%
  • GitHub
    75%
  • Azure DevOps
    75%
  • Jira
    60%
  • Confluence
    60%
  • Visual Studio Code
    80%
  • SQL Server Management Studio
    50%
Languages
  • English
  • German
  • Polish
Soft skills
  • Scrum & Agile
  • CI/CD
  • Cross-Functional Collaboration
  • Critical thinking
  • Communication
  • Business analysis
  • Team work organization

About

This section does not contain specific information, but I encourage you to read it if you want to get to know me better. I try to explain how I got to where I am.

How did I start?

I am 28 years old and grew up in Dobrodzień, a small town in southern Poland. As a child, I used to service computers and boost their performance to be able to play my favorite games. That’s how I ended up in a technical high school with an IT specialization, where for the first time I encountered programming and web development. I think I was good at it, but it wasn’t my passion. During that time I did an internship abroad in Germany, and during summer breaks I went to Munich, which allowed me to learn German and earn some money.

Studies

I pursued engineering studies, where I rediscovered my passion for computer science. Looking back, I realize I was naturally gravitating toward specializations focused on data. I dedicated significant effort to learning Python, applying it to algorithms and data structures, and developed strong skills in databases and SQL. During my master’s studies, I specialized in machine learning. For my master’s thesis, I built custom smog measurement stations and trained models to programmatically replace the heater, enhancing the accuracy of smog measurements - a critical environmental issue in Poland.

Job

Even during my master’s studies, I started my first commercial job as a Data QA. Very quickly, I realized that this role was not for me. Nevertheless, it allowed me to explore various job roles in the data field, including data engineering. Already on my second project, I began transitioning into a data engineer role. Without explicit consent, I simply took on other tasks besides QA that interested me more. That is how I became a data engineer. After several months, I was the lead data engineer on the project, although I also continued to work on data quality assurance projects within the company until the end.

After work

I have a group of close friends with whom I spend time, but after hours spent in front of the computer, I find fulfilment in sport. Most often it is road cycling. Although I try to set new goals and improve my results, I mainly ride for recreation. You can find me on the trails in my area, but I also ride in other interesting places in Poland and Europe. Besides, you can find me in the mountains, both on foot and on skis.

CONTACT

Let's make your project brilliant!

LinkedIn
/in/sitarek/
patryk.sitarek@interia.pl
Address
Katowice, Poland

© 2025 RyanCV