09.11.2025 aktualisiert


100 % verfügbar
Cloud Data Engineer
RANDBURG, Südafrika
Weltweit
Über mich
Cloud-native Data Engineer (4+ yrs) skilled in building scalable, automated data pipelines across Azure, Snowflake & Databricks. Experienced in ML workflows, CI/CD, and transforming large datasets into production-ready insights. Passionate about real-time data & performance optimization.
Skills
Data AnalysisAutomatisierungMicrosoft AzureBusiness IntelligenceCloud ComputingInformation EngineeringDateninfrastrukturETLDatenvisualisierungDevopsGithubPythonMachine LearningPower BiDataOpsSQLWorkflowsYAMLData ScienceAzure Data FactorySnowflakeDeep LearningGitData LakePysparkScikit-learnData LineageEchtzeitdatenPlotlyDatenmanagementAzure Synapse AnalyticsErkennung von AnomalienAzure Resource ManagerDatabricks
Cloud Data Platforms
Azure Data Factory, Synapse, Databricks, Data Lake Gen2, and Snowflake expertise for scalable data infrastructure
Data Engineering & ETL
ETL/ELT processes, Delta Lake, PolyBase, DataOps, Data Lakehouse, and Data Lineage implementation
Machine Learning & Analytics
Python, SQL, PySpark, Scikit-Learn, PCA, Anomaly Detection, and Deep Learning for ML workflows
DevOps & Automation
Azure DevOps, CI/CD pipelines with YAML, GitHub Actions, Git, and ARM Templates
Data Visualization
PowerBI and Plotly for reporting and business intelligence dashboards
Cloud Certifications
Microsoft Azure Fundamentals, Azure Data Engineer Associate, and Snowflake Advanced Workshops
Data Science Specialization
IBM Data Science Professional certificate and Machine Learning Specialization
Real-time Data Processing
Event-driven ingestion, transformation, and real-time analytics frameworks
Azure Data Factory, Synapse, Databricks, Data Lake Gen2, and Snowflake expertise for scalable data infrastructure
Data Engineering & ETL
ETL/ELT processes, Delta Lake, PolyBase, DataOps, Data Lakehouse, and Data Lineage implementation
Machine Learning & Analytics
Python, SQL, PySpark, Scikit-Learn, PCA, Anomaly Detection, and Deep Learning for ML workflows
DevOps & Automation
Azure DevOps, CI/CD pipelines with YAML, GitHub Actions, Git, and ARM Templates
Data Visualization
PowerBI and Plotly for reporting and business intelligence dashboards
Cloud Certifications
Microsoft Azure Fundamentals, Azure Data Engineer Associate, and Snowflake Advanced Workshops
Data Science Specialization
IBM Data Science Professional certificate and Machine Learning Specialization
Real-time Data Processing
Event-driven ingestion, transformation, and real-time analytics frameworks
Sprachen
EnglishMutterspracheFrenchGrundkenntnisse
Projekthistorie
Designed and deployed scalable CI/CD pipelines for ADF and Synapse using YAML, GitHub Actions, and DevOps best practices. Refactored ETL architecture, migrating from Synapse to Snowflake & ADF, increasing data delivery speed 5x. Enabled anomaly detection ML pipelines for cloud cost forecasting using Scikit-learn and PySpark.
Built cloud-hosted ML pipelines for house price prediction with 94%+ model accuracy. Developed fingerprint biometric authentication system reducing access errors by 62%. Designed end-to-end data pipelines for insurance claim analytics, reducing processing time by 30%.
Built predictive maintenance models for mechanical failure modes using time-series analytics. Created real-time Power BI dashboards integrating Python forecasts for production reporting. Initiated enterprise-wide data infrastructure with standardized governance and access controls.