06.09.2022 aktualisiert

**** ******** ****
100 % verfügbar

Research Scientist | Teaching | Deep Learning | Machine Learning |Speaker

Heidelberg, Deutschland
Weltweit
Masters in Cloud Computing
Heidelberg, Deutschland
Weltweit
Masters in Cloud Computing

Profilanlagen

CV
Stanford University ML
AI awards sponsored by Microsoft and SAP
AWS certification
Cloudera Certification

Skills

XGBoost, Deep Learning, Neural Network, Stacked Ensemble model, Hyperparameter, Machine Learning, NLP, Random Forest, DT, GBM, Ensemble, Grid Search, DFS, Pandas, Numpy, Matpolitlib, Plotly, Scikit, Spacy, Pytorch, Tensorflow, ScikitLearn, JAVA, Python, UNIX, Bash Scripting, R, Big Data, Hadoop, Spark, Map-Reduce, HBase, Hive, Sqoop, Impala, NoSQL, MapR, Cloudera, Cloud, AWS, IBM Blumix, MySQL, Oracle, Version Control, GIT, Gradle, IDEs, Jupyter Notebook, VS Code, PyCharm, Docker, Tableu, Windows, Linux, Mac OS, Scikit-learn,  feature engineering, Bigdata, Kaspersky, Algorithm, CI/CD, Jenkins, POC,  algorithms, extract, transform and load, caching, Amazon AWS, MapReduce, DB, bash, map reduce, RDBMS, HDFS

Sprachen

DeutschverhandlungssicherEnglischMuttersprache

Projekthistorie

Data Scientist

Softgarden

 

 

 Deployed H20, Scikit-learn, and PyTorch based salary prediction models for recruiting managers 

 Improved the model performance via Distribution Analysis like Histogram, Boxplot, Correlation Plot by removing outliers 

 Increased the final machine learning model performance by 5 % using ensemble models of Deep Leaning, XGBoost, GBM 

 Increased the machine learning model performance by creating new 300 Dimension Smooth Inverse Frequency Vectors 

 Working closely with Product owners and stake holders to build new machine learning use cases for customers 

 Developing NLP and Facebook Fast Text based work force recruitment Recommendation model 

 Statistical Analysis to find out the relationship and to build better feature engineering for ML models 

 Cleaning Vast amount of structured and unstructured data for machine learning models using NLP and Bigdata tools 

 Creating Machine Leaning Pipe line for Production with Flake8 and Pylint convention 

Data Scientist

Kaspersky Lab R&D
* Prevention of Domain Generation Algorithm using Random Forest and Deep Learning
* Creating Fail fast machine learning models as POC to understand Model Performance and Feature Contributions
* Worked with lateral movement detection algorithm using neural network
* CI/CD using Jenkins and Docker
* Note : Since they shut down the whole Ireland office, everyone had to leave the company.

Data Scientist (R&D)

First Data Corporation, R&D
* Developed deep learning based credit and debit card fraud detection model using historic banking data
* Improved machine learning model performance by 7% using Hyper parameter grid search to obtain best model
* Statistical correlation analysis and plots to find outliers and feature importance with targeted data
* Deployed NLP based Obfuscation Detection Algorithm to prevent fraudsters using sentence embedding
* Created POC to reach out clients by a web-based machine learning model for credit risk analysis
* Developed fail fast machine learning algorithms and stacked ensemble models using Automatic Machine Learning
* Automating Hadoop Spark jobs to collect, extract, transform and load to ML models from EDH to Local

Kontaktanfrage

Einloggen & anfragen.

Das Kontaktformular ist nur für eingeloggte Nutzer verfügbar.

RegistrierenAnmelden