Skip to content
View mhaegeman's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report mhaegeman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mhaegeman/README.md
English Français Español Dansk


Typing SVG


Profile Views LinkedIn Portfolio Medium

📍 Copenhagen, Denmark  |  Open to Data Engineering & ML Engineering roles


class Maxime(DataScientist, DataEngineer):
    """
    Developer bridging the gap between models and production,
    architecting scalable data & ML pipelines.
    """
    def __init__(self):
        self.role    = "Senior Data / ML Engineer"
        self.base    = "Copenhagen, Denmark"
        self.exp     = 5  # years of experience

    def mission(self) -> str:
        return "From messy data to reliable predictions, at scale."

🔧 Technical Skills

languages   Python SQL Bash YAML

cloud_infra  Azure Databricks GCP AWS Docker RAG GitHub Actions

data_eng   PySpark Airflow Snowflake Delta Lake CI/CD Data Governance

ml_stack   TensorFlow PyTorch HuggingFace spaCy MLflow XGBoost

monitoring  Grafana Tableau PowerBI

project_mgmt  Agile Scrum Kanban JIRA Confluence OKRs


💼 Experience

Data EngineerMassive Entertainment (Ubisoft Studio) | Malmö, Sweden | Mar 2024 – Present

  • Technical reference for ML and analytics pipelines; define coding standards and conduct code reviews.
  • Led the migration from a legacy data platform to Azure Databricks, improving scalability and compute efficiency.
  • Designed real-time monitoring pipelines with Grafana for live game analytics.
  • Built and maintained a core internal Python library shared across the data team.

Data Scientist / ML EngineerFollo | Dongen, Netherlands | Jan 2023 – Feb 2024

  • Built Python back-end services powering internal analytics tooling used daily by the marketing team.
  • Designed end-to-end Data Warehouse architectures on GCP (BigQuery + GCS) for external clients.
  • Delivered NLP products for customer review sentiment analysis and automated text generation.
  • Built ETL pipelines with Airflow and Pandas for multi-source data collection and preprocessing.

Data ScientistAnteriad | Madrid, Spain | Apr 2022 – Dec 2022

  • Built a global CRM data pipeline (Python + SQL) to collect, validate, and clean data at scale.
  • Deployed propensity models reducing outbound call volume by 25% through precise targeting.
  • Fine-tuned a Transformer model for postal address country classification — 83% accuracy across 21 countries.
  • Developed a speech-to-text analysis tool to extract insights from customer service calls using NLP.

Junior Data Scientist — BECQUET | Armentières, France | Sep 2020 – Mar 2022

  • Built SQL Server dashboards for sales performance and customer analytics reporting.
  • Developed a customer segmentation & scoring model, reducing marketing costs by 10%.
  • Identified a new target market segment through exploratory data analysis and visualization.
  • Delivered ad-hoc analysis and reporting to support marketing strategy and decision-making.

🚀 Selected Projects

Project Description Stack
seo-content-generator Automated content generation tool optimized for SEO performance Python NLP
openweather Integration with OpenWeather API for real-time meteorological data analysis API Data Integration
scoring-bank-project Credit scoring model and analysis for banking customer data Data Science Scoring

🎓 Education

  • Executive Masters in Deep Learning — MIOTI (2022)
  • Masters in Data Science — OpenClassrooms (2020–2022)
  • BSc in General Engineering — HEI (2016–2019)

🌍 Languages

  • 🇩🇰 Danish: B1 (actively learning)
  • 🇬🇧 English: Fluent
  • 🇫🇷 French: Fluent
  • 🇪🇸 Spanish: Fluent

📊 GitHub Stats

GitHub Stats   Top Languages


If you are hiring in Copenhagen for Data Engineering or ML Engineering roles, I would love to connect.

Let's Connect

Popular repositories Loading

  1. scoring-bank-project scoring-bank-project Public

    A scoring model and the interactive dashboard for Fraud Detection

    Jupyter Notebook 4

  2. seattle-building-energy-forecast seattle-building-energy-forecast Public

    Regression pipeline that predicts annual site energy use (kBtu) of Seattle commercial buildings from the city's public 2015/2016 benchmarking data.

    Jupyter Notebook 3

  3. nutriscore-predictor nutriscore-predictor Public

    Linear regression algorithm to predict a Nutriscore for food products using the open source database OpenFoodFacts

    Jupyter Notebook 2

  4. Python-Object-Clasifier Python-Object-Clasifier Public

    - NLP :-LDA-Text Classification-

    Jupyter Notebook 2

  5. python-client-segmentation python-client-segmentation Public

    https://www.kaggle.com/olistbr/brazilian-ecommerce

    Jupyter Notebook 2

  6. little-one little-one Public

    TypeScript 2