18.05.2025 aktualisiert

Machine Learning Engineer / Data Scientist / Data Engineer / Data Science Project Manager

Fürth, Deutschland

Weltweit

Fürth, Deutschland

Weltweit

Profilanlagen

Skills

Python TensorFlow PyTorch Data Science Big Data Deep Learning Cloud Computing (AWS/GCP) Machine Learning Research & Development Artificial Intelligence Künstliche Intelligenz Data Scientist machine learning data science NLP Computer Vision LLM GPT

Highly skilled and experienced freelance machine learning engineer/consultant with a deep business understanding specialized in state of the art deep learning, machine learning and data science with a proven track record of delivering high-quality results in a fast-paced and production-ready environment.

I have worked on projects for various clients in different industries, using my expertise to help the organisation improve efficiency, reduce costs, and increase revenue through the use of data-driven solutions.

Frameworks:

Keras, PyTorch, scikit-learn, TensorFlow, XGBoost
Conda/Anaconda, Jupyter, Matplotlib, NumPy, openCV, pandas, plotly, Poetry
MLflow, SageMaker, Vertex AI

Applications:

Anomaly Detection, Audio Analysis and Synthesis, Clickstream Analysis, Computer Vision, Content Understanding, Data Analysis, Data Mining, Data Visualisation, Deep Learning, Dynamic Pricing, Fraud Detection, Image Processing, Image Recognition/Classification, Machine Learning, Natural Language Processing (NLP), Natural Language Understanding, Product Similarities, Recommendation Systems, Speech Recognition

Algorithms:

Deep Neural Networks, Convolutional Neural Networks, LSTM, (Variational-)Autoencoder, Transformers
Hyperparamer Tuning, Transfer Learning
Model/Feature Analysis using SHAP
Dimensionality-Reduction (PCA, t-SNE, LDA, Autoencoder, UMAP)

Programming Skills:

Python
C/C++, Java, MATLAB/GNU Octave, PHP
Clean Code, PyTest, Static Code Analysis, Unittest
Bamboo, Bitbucket, Jenkins, Git, GitHub, GitLab
Software Development and Software Architecture
Linux, macOS, Windows

Database skills:

Apache Spark, BigQuery, Elasticsearch, Exasol, Graylog, Kibana, MS-SQL, MySQL, Oracle DB

Big Data:

Amazon Web Services (AWS), EMR, SageMaker, Apache Spark
Google Cloud Platform (GCP), BigTable, BigQuery, Vertex AI
Hadoop, PySpark
FFmpeg for Video Processing

Virtualisation:

Docker
Kubernetes

Agile-Tools:

Confluence, Jira, Miro, Slack, Teams, Trello

Sprachen

DeutschMutterspracheEnglischverhandlungssicherUngarischverhandlungssicher

Projekthistorie

Machine Learning Engineer / Data Scientist for Search Engines

OTTO

Konsumgüter und Handel

>10.000 Mitarbeiter

As a machine learning engineer and data scientist in the search team at OTTO, my main task is to use
state of the art machine learning techniques to improve the search experience for our customers.
The Solr search engine, which processes 1.000 queries per second and supports around 20 million
product variants 24/7, is central to OTTO's e-commerce platform. All improvements are extensively
tested and validated through online experiments.

Learning to Select: Improved query precision by filtering out irrelevant results through
comprehensive data-driven solutions on clickstream data. Also identified and removed fraudulent
and bot-generated queries to improve model performance and data integrity.

Hybrid Search: Collaborated with two teams to develop a system that integrates both lexical and
semantic search approaches to provide more relevant search results.

Advanced Spell Check: Designed, implemented, validated and brought to production a leading-edge
spell checking system. This solution not only corrects customer spelling errors but also guides them
towards the most relevant products.

Query Intent Detection: I also led the development of a customer query intent detection approach
to identify non-product and navigation queries, and to recognize brand names and their context
within search queries (Named entity recognition and classification).

Toolkit: AWS, GCP, BigQuery, Clickstream Data, FastText, Huggingface Transformers, MLflow, OpenAI
API, SageMaker, AirFlow, Docker, Jenkins, Terraform, Grafana, Prometheus, Elasticsearch, Kibana,
Confluence, Jira, Miro, Agile/Scrum, FastAPI, Poetry, Python, PyTorch, GitHub, Online
Experiments/Testing, Solr, Pair Programming

Large Language Model (LLM) Integration Consultant for Start-Ups

Divers

As an external consultant, I helped startups to use GPT and other large language models (LLMs). I
provided training, evaluated use cases, assessed limitations such as security, performance, accuracy
and explored options/alternatives to the OpenAI API.

Toolkit: Haystack, Hugging Face models, LangChain, Ollama, OpenAI API, Python

Data Product Owner & Solutions Architect / Machine Learning Consultant

RTL Deutschland

Medien und Verlage

5000-10.000 Mitarbeiter

As a freelance consultant and expert in machine learning applications for content understanding, I
supported the RTL Data team in building the next generation multi-purpose platform "RTL+" in
cooperation with Deezer, using visual (video), audio and text data. An integral part of my role was to
manage and balance the needs and expectations of the various stakeholders involved in the project.

The primary goal of this project is to derive and provide additional metadata from the raw content
that can be used by downstream applications such as search, recommendation, and personalization.
The key challenge is to establish a clean, reliable, scalable, and production-ready state-of-the-art
solution for a large number of building blocks and to create an efficient execution pipeline on top of
it.

Video based models: Aesthetic Ranking, Dominant Color Extraction, End Credits Detection, Face Detection, Image Quality Detection, Logo Detection, Mood Detection, Object detection and Recognition, Place Prediction, Scene and Shot-Boundary Detection, Shot Type Detection by using and optimizing both pre-trained and self-trained models.

Audio based models and solutions: Speech-to-Text transcriptions using Google’s Speech-to-Text API and Whisper from Open-AI on Podcasts and other audio sources and music identification.

NLP solutions: language detection (fastText), festivity detection, kids content detection, adult content detection, topic modeling (BERTopic), keyword extraction (KeyBERT) and text summarization.

Toolkit: Argo Workflows, Confluence, Docker, Elasticsearch, FFmpeg, GitLab CI/CD, Google BigQuery, Google Cloud Platform (GCP), Google Data Studio, Grafana, Hugging Face models, Jira, Jupyter/JupyterLab, Kafka, Kibana, Kubernetes, MLflow, NumPy, pandas, Poetry, Pub/Sub, Python, PyTorch, Scrum, spaCy, SQL, Streamlit, TensorFlow, Terraform

Zertifikate

Kontaktanfrage

Einloggen & anfragen.

Das Kontaktformular ist nur für eingeloggte Nutzer verfügbar.

Registrieren Anmelden