All Projects
Machine Learning

Workforce Absenteeism Prediction Platform (Azure Fabric)

Designed and deployed a production-grade workforce absenteeism prediction system on Azure Fabric, enabling proactive workforce planning through weekly employee-level risk forecasting and decision intelligence dashboards.

Weekly
prediction cadence
Employee-level
risk scoring granularity
Proactive
decision-making shift
System Overview
Workforce Absenteeism Prediction Platform (Azure Fabric) system overview

1The Problem

A large apparel manufacturing organization relied on reactive reporting to manage absenteeism, leading to production disruptions, overtime costs, and inefficient workforce planning. Key challenges: - No forward-looking absenteeism insights - Manual and lagging reporting processes - Inability to link absenteeism to behavioral and operational drivers - Production inefficiencies due to unexpected workforce shortages This resulted in missed production targets, line imbalances, and increased operational costs.

2The Approach

I designed a batch-based decision intelligence system that predicts absenteeism risk at an employee × week level and integrates directly into workforce planning workflows. Key decisions: - Frame the problem as probabilistic risk prediction instead of binary classification - Use a global XGBoost model to capture shared patterns across employees - Implement medallion architecture (bronze → silver → gold) for scalable data modeling - Align prediction cadence with weekly production planning cycles - Integrate predictions directly into Power BI dashboards for decision-making

Technical Architecture

Architecture Diagram
Workforce Absenteeism Prediction Platform (Azure Fabric) architecture diagram
1

Data Sources: HRIS, attendance, leave, and shift systems

2

Data Ingestion: Azure Fabric Data Factory pipelines into OneLake

3

Data Modeling: Lakehouse architecture (Bronze → Silver → Gold layers)

4

Feature Engineering: Fabric Notebooks generating behavioral and temporal features

5

ML Training: XGBoost models trained within Fabric notebooks

6

Model Management: Versioning and governance within Fabric workspace

7

Batch Inference: Weekly prediction pipelines generating employee-level risk scores

8

Consumption Layer: Power BI dashboards for HR, factory leadership, and management

9

Monitoring Layer: Model performance tracking and workforce impact analysis

Results

  • Enabled proactive identification of high-risk absenteeism patterns

  • Reduced production disruptions through better workforce planning

  • Improved alignment between workforce availability and production demand

  • Decreased reliance on reactive overtime planning

  • Increased adoption of data-driven HR decision-making

Key Insights

Framing predictions as risk scores enables flexible decision thresholds

Global models improve stability across employees with limited data

Aligning ML outputs with operational cycles increases business impact

Medallion architecture simplifies reuse across analytics and ML workloads

Embedding ML outputs directly into dashboards drives adoption

Tech Stack

PythonXGBoostAzure FabricOneLakeSynapse Data EngineeringPower BIMachine Learning

Interested in a similar project?

I'd love to discuss how we can apply this kind of approach to your specific business challenge.

Let's Talk