![Paweł Mitruś, Developer in Warsaw, Poland](http://assets.toptal.io/images?url=http%3A%2F%2Fbs-uploads.toptal.io%2Fblackfish-uploads%2Ftalent%2F891013%2Fpicture%2Foptimized%2Fhuge_436df152e53196062e3adab82f4070bf-b6d0c8320099240ee6d3f233f6a63b73.jpg&width=524)
Paweł Mitruś
Verified Expert in Engineering
Data Architect and Developer
paweowis是一名数据工程师和架构师,拥有多年使用各种技术构建数据平台的经验, including Azure and Microsoft. Apart from traditional ETLs, data lakes, and data warehouses, 他还精通各种商业智能工具和服务. For the past few years, Paweł's focused on cloud projects, sourcing from both on-premise and cloud locations. 最近,paweows一直在担任一个主要数据网格实现的首席架构师.
Portfolio
Experience
Availability
Preferred Environment
Azure, Databricks, SQL, PySpark, Azure Data Factory, Microsoft Power BI, Azure SQL, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), SQL Server BI, Azure Analysis Services
The most amazing...
...Role是一个数据网格项目的首席架构师,该项目涉及40多个开发人员和20个不同的领域团队,将其集成到平台中.
Work Experience
Solution Architect
Lingaro
- Led a team of 6-8 tech leads to design and develop a data mesh platform that consisted of several microservices; also helped to plan automation in context of CI/CD.
- 提供了大约20个关于Databricks平台的最佳实践和反模式的不同培训课程(内部和外部会议),旨在提高参与者的技能.
- 使用所见即所得编辑器设计并开发了一个自定义ETL框架, 非开发人员可以使用它来以自助方式装载他们自己的ETL管道. 该框架类似于同样在Databricks上执行的ADF数据流.
- 通过应用最佳实践和减少未来的问题,帮助优化Spark应用程序的性能.
- 执行了多个Azure Monitor分析,旨在发现被滥用的服务,例如.g., in big data batch processing, 知道几个标记的比例应该是什么样的,并进行分析,结果是200美元,000 in savings.
- Consulted in multiple "traditional" data lake, data warehouse (DWH), 和在线分析处理(OLAP)项目,并帮助规划特定需求集的架构,建立和配置环境(Azure)。.
Freelance Lead Analytics Developer and Product Designer
Azum
- 为从用户设备上传到Azum平台的体育活动设计监控和分析功能.
- Described and helped to understand developers how FIT, TCX, 以及包含活动细节的GPX文件应该如何处理以及如何解释.
- Helped to organize the process of gathering requirements, specifying them, and handing them over to the development team in a Scrum manner.
Solution Architect
ITMAGINATION
- Led several teams, as a solution architect, 与11-15名开发人员在不同的项目中成功交付了超过10个数据分析平台,最终用户总数超过500人.
- 计划并执行从SQL Server 2008R2到2016年BI平台的主要迁移,该平台包括15个不同的区域.
- Optimized a data warehouse refresh from 12 to four hours, 主要是通过应用适当的数据结构和索引,还有分区表.
- 在现有的SSIS框架中实现了一个数据质量面板,该面板收集关于读取/插入行的信息,以便通过不同的数据层(分段)跟踪行数, data warehouse, and semantic).
Data Developer
ITMAGINATION
- 通过与团队和个人一起分析客户的需求,帮助设计数据仓库星型模式以及事实和维度表(Ralph Kimball).
- 构建并发布数据仓库(DWH)和商业智能(BI)项目,其中包括与SSIS的集成, a data warehouse hosted on SQL Server 2012-2016, an OLAP database as SSAS (multidimensional and tabular), and reports in SSRS.
- 开发一个基于SQL Server 2012 MDS的MDM系统,包括培训数据管理员(客户端)如何使用app和Excel表单.
- Delivered a couple of training sessions regarding PowerQuery, PowerPivot, PowerReport, 熟练使用Microsoft Excel数据透视表(自助式BI).
Experience
Data Mesh
技术栈:Azure, Databricks (Python), Airflow, Azure SQL, Azure Data Lake Gen2, App Services
Azure Data Analytics Platform
我的角色主要涉及架构咨询和帮助计划实现. 我还帮助解决性能问题并调整云利用率以降低总体成本.
Technology Stack: Azure, Data Factory, Databricks, Azure SQL, Azure SQL Data Warehouse (Synapse), Databricks, Azure Data Lake Gen2, Event Hub, Azure Analysis Services, Power BI
Global Business Intelligence
开发工作持续了两年多,涉及5-7名开发人员. 我们每天以批处理模式实现一次ETL,以便用户可以访问数据仓库(DWH)。, OLAP database, or predefined reports. Due to the immaturity of the Azure PaaS services, we decided to host the solution mostly on VMs (IaaS).
技术堆栈:Azure, MS SQL Server 2016 (SSIS, SSRS), Azure分析服务,PowerBI
Education
Engineer's Degree in Computer Science
Warsaw University of Technology - Poland, Warsaw
Certifications
Azure Solutions Architect
Microsoft
Agile PM
APMG International
Professional Scrum Master 1 (PSM1)
Scrum.org
Microsoft Certified Professional
Microsoft
Skills
Libraries/APIs
PySpark, REST APIs
Tools
SQL Server BI, Microsoft Power BI, Visual Studio, Azure应用服务,Azure逻辑应用
Languages
SQL, T-SQL (Transact-SQL), Python
Platforms
Azure, Databricks, Azure SQL Data Warehouse, Visual Studio Code (VS Code), Dedicated SQL Pool (formerly SQL DW), Azure Event Hubs
Paradigms
ETL, Scrum, Agile, Kimball Methodology, Azure DevOps, DSDM
Storage
Azure SQL, Data Lakes, Data Pipelines, SQL Server Analysis Services (SSAS), SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), SQL Server DBA, JSON
Frameworks
Apache Spark, Django
Other
Azure Data Factory, Architecture, Cloud, Data Engineering, Data Modeling, Data Architecture, Azure Analysis Services, Azure Data Lake, Domain-driven Design (DDD), Cloud Infrastructure, Azure Resource Manager (ARM), Big Data, Data Analytics, Distributed Systems, Azure Virtual Machines, Data Mesh
How to Work with Toptal
在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring