09.11.2025 aktualisiert


Premiumkunde
40 % verfügbarSenior Data Architect and Data Engineer
Berlin, Deutschland
Weltweit
Bachelor of Science in Computer ScienceÜber mich
Over 25 years of extensive experience in designing and delivering large-scale data architectures and strategies, analytics solutions, and cloud transformations, with extensive hands-on experience in architecting and implementing large-scale data platforms in cloud and on-premises.
Skills
Data & AnalyticsData Warehouse ArchitectureBig Data ArchitectureBI / Data-ManagementAWS (Amazon Web Services)OracleMSSQL ServerMySQLPostgresqlData EngineeringDatabase Administration DatenbankadministrationDatabase ArchitectData ArchitectureSpark Streaming/KafkaSparkKinesisKafkaAWS GluePythonData warehouse EntwicklungDWH DatenmodellierungDWH AutomationDWH ExpertSales Presentations Presales Technical SupportSales Engineeringcustomer engagementData WarehousingGreenplum Post-Sales SupportData InfrastructureData Solution ArchitectureKundendienstPre-Sales Technical ConsultingDatenmigrationDatenvisualisierungData StrategiesTechnical PresentationsDatenmodellierungData GovernanceData StrimingCloud MigrationdbtApache AirflowSnowflakeAzure Databricks Databricks
• Senior Data Architect and Data Engineer with over 25 years of experience in designing and delivering large-scale data architectures and strategies, analytics solutions, cloud transformations and migrations.
• Extensive hands-on experience in architecting and implementing large-scale data platforms in cloud and on-premises.
• Strong proficiency in data architecture: Data Warehouse, Operational Data Store, Data Lake, Lakehouse, Data Mesh.
• Deep expertise in data modelling: Relational, Dimensional (Star, Snowflake, Galaxy), Data Vault, Data Marts, 3NF.
• Extensive experience with Massively Parallel Processing (MPP) and Distributed Data systems.
• Significant knowledge and experience with Data-as-a-Service (SaaS/DaaS), AI/ML analytics and Business Intelligence.
• High Proficiency in designing and implementing high performance ETL/ELT and streaming data pipelines.
• Strong background in Data Governance, Data Lineage, Data Quality and Data Security.
• Expert in optimization and administration of high volume OLTP/OLAP databases in mission-critical 24x7 environments, improving performance, scalability, and high availability.
Sprachen
DeutschverhandlungssicherEnglischverhandlungssicher
Projekthistorie
Led end-to-end architecture, design, and delivery of enterprise data and analytics solutions across multiple customers, providing strategic advisory on cloud data platforms, data modernization, and advanced analytics. Architected large-scale AWS data platform modernization initiatives, implementing centralized medallion architectures, dimensional models, modular pipelines with Airflow/dbt, and Snowflake performance optimization. Directed major Oracle-to-AWS Redshift migration programs, designing cloud-native models, integrating S3-based data lakes, orchestrating automated processing with Lambda, and enabling multi-region, multi-cluster data sharing; additionally set up SageMaker-based ML environments integrated with DWH and Data Lake.
Delivered Azure-based BI and analytics platforms using Databricks, DLT, Debezium CDC pipelines, and centralized semantic models by Azure Analysis Services, while optimizing Spark performance and cluster efficiency. Enhanced enterprise Data Vault implementations through performance tuning, PIT/Bridge design, model refinement, and query optimization . Led MDM migration from IBM InfoSphere to Informatica CP4D, improving data quality, benchmark performance, and operational consistency.
Provided strategic architectural assessments, roadmaps, executive presentations, and cross-functional program leadership across cloud transformation and analytics initiatives.
Engaged part-time for strategic enterprise customers, managed end-to-end data architecture and solution design, with focus on cloud migration, data platform modernization, transformation, and ML/AI analytics implementation. Provided strategic advice on cloud migration, performance tuning, and data engineering best practices.
• Private Cloud Data Platform Implementation: Architected and deployed private cloud data platform on VMware vSphere cluster tailored to customer’s infrastructure and AI/ML workloads requirements. Designed and implemented multi-tier reference architecture integrating Greenplum MPP data warehouse for large-scale analytical processing, Apache Kafka for real-time event streaming, Kubernetes for containerized data workloads orchestration, and Apache Solr for distributed text search and indexing. Developed real-time data ingestion and transformation pipelines using Kafka Connect and Schema Registry. Optimized Kafka on Kubernetes by tuning partitions, replication factors, and broker configurations. Implemented observability and monitoring by Prometheus/Grafana.
• Enterprise Data Warehouse Migration and Optimization: Led Oracle Exadata to Greenplum cluster migration, rearchitected data models and optimized storage for high-performance queries. Implemented RabbitMQ with Debezium for real-time change data capture (CDC) and streaming. Implemented VectorDB for Generative AI and large language models (LLMs) for advanced search and retrieval of data.
• Cloud Migration Proof of Concept: Designed and executed multi-cloud migration PoC, assessing AWS, Azure, and GCP for compatibility with enterprise data and analytics workloads. Defined success KPIs (e.g., data transfer throughput, query latency, storage performance, operational cost efficiency, and scalability benchmarks) to objectively assess each platform. Executed end-to-end data migration tests including bulk data transfer. Validated analytics and streaming/real-time processing for performance and integration with existing pipelines. Delivered architecture recommendations for full-scale adoption, multi-cloud integration patterns and data lake/data warehousing layer strategies.
• Data Platform Modernization and Advisory: Assessed legacy on-premises data infrastructure and designed modern, cloud-native data platforms using Greenplum MPP DWH and containerized microservices (Kubernetes). Advised on scalability, disaster recovery, and high-availability architectures.
Skills: Data Warehousing · Big Data · Data Modeling · Sales Presentations · VMware vSphere · RabbitMQ · Apache Kafka · Data Governance · Data Architecture · Data Warehouse Architecture · Data Migration · Data Engineering · Kubernetes · Pre-Sales Technical Consulting · Data Visualization · Data Strategies · Technical Presentations · Data Solution Architecture · Customer Support · Cloud Migration · Greenplum · Pre-Sales Support · Data Security · Data Streaming · Data Infrastructure · Post-Sales Support · Sales · Customer Engagement · PostgreSQL · Data Quality · Sales Engineering · Presales Technical Support
• Private Cloud Data Platform Implementation: Architected and deployed private cloud data platform on VMware vSphere cluster tailored to customer’s infrastructure and AI/ML workloads requirements. Designed and implemented multi-tier reference architecture integrating Greenplum MPP data warehouse for large-scale analytical processing, Apache Kafka for real-time event streaming, Kubernetes for containerized data workloads orchestration, and Apache Solr for distributed text search and indexing. Developed real-time data ingestion and transformation pipelines using Kafka Connect and Schema Registry. Optimized Kafka on Kubernetes by tuning partitions, replication factors, and broker configurations. Implemented observability and monitoring by Prometheus/Grafana.
• Enterprise Data Warehouse Migration and Optimization: Led Oracle Exadata to Greenplum cluster migration, rearchitected data models and optimized storage for high-performance queries. Implemented RabbitMQ with Debezium for real-time change data capture (CDC) and streaming. Implemented VectorDB for Generative AI and large language models (LLMs) for advanced search and retrieval of data.
• Cloud Migration Proof of Concept: Designed and executed multi-cloud migration PoC, assessing AWS, Azure, and GCP for compatibility with enterprise data and analytics workloads. Defined success KPIs (e.g., data transfer throughput, query latency, storage performance, operational cost efficiency, and scalability benchmarks) to objectively assess each platform. Executed end-to-end data migration tests including bulk data transfer. Validated analytics and streaming/real-time processing for performance and integration with existing pipelines. Delivered architecture recommendations for full-scale adoption, multi-cloud integration patterns and data lake/data warehousing layer strategies.
• Data Platform Modernization and Advisory: Assessed legacy on-premises data infrastructure and designed modern, cloud-native data platforms using Greenplum MPP DWH and containerized microservices (Kubernetes). Advised on scalability, disaster recovery, and high-availability architectures.
Skills: Data Warehousing · Big Data · Data Modeling · Sales Presentations · VMware vSphere · RabbitMQ · Apache Kafka · Data Governance · Data Architecture · Data Warehouse Architecture · Data Migration · Data Engineering · Kubernetes · Pre-Sales Technical Consulting · Data Visualization · Data Strategies · Technical Presentations · Data Solution Architecture · Customer Support · Cloud Migration · Greenplum · Pre-Sales Support · Data Security · Data Streaming · Data Infrastructure · Post-Sales Support · Sales · Customer Engagement · PostgreSQL · Data Quality · Sales Engineering · Presales Technical Support
Architected and implemented AWS based data warehouse, integrated real-time data streaming with Apache Kafka and AWS Aurora (PostgreSQL). Designed and implemented ETL pipelines using AWS Glue, Spark, and PySpark. Created data lake and data warehouse with Amazon Redshift/Spectrum and AWS S3. Created master data management and data dictionary using AWS Glue Catalog. Implemented data governance policies, security protocols, and access controls.
Skills: Data Warehousing · Amazon Aurora · Amazon Web Services (AWS) · Data Modeling · Apache Kafka · Business Intelligence (BI) · Data Analytics · Data Architecture · Data Migration · Data Visualization · AWS Glue · Data Streaming · Amazon Redshift · Apache Spark · Data Infrastructure
Skills: Data Warehousing · Amazon Aurora · Amazon Web Services (AWS) · Data Modeling · Apache Kafka · Business Intelligence (BI) · Data Analytics · Data Architecture · Data Migration · Data Visualization · AWS Glue · Data Streaming · Amazon Redshift · Apache Spark · Data Infrastructure
Zertifikate
Administering Microsoft SQL Server 2012
TÜV Rheinland Group2014
Oracle Certified Professional (DBA)
Oracle2010