Learn how to clean and prepare CMMS exports for analysis, including SAP PM, Maximo, Maintenance Connection, Fiix, and more. Step-by-step guide and checklists.

If you want accurate maintenance insights, you must first clean and prepare CMMS exports effectively. Whether you use SAP PM, IBM Maximo, Maintenance Connection, Fiix, UpKeep, or MEX, the exported CSV is only as good as the structure you impose on it. Raw CMMS data is often messy: duplicated assets, inconsistent naming, empty fields, date format issues, and "free text chaos" that hides real trends.
This guide offers a battle-tested, field-proven process used by planners, reliability engineers, and CI/lean teams to transform raw CMMS data into clean, analysis-ready datasets — without spending hours sanitising spreadsheets.
Most maintenance systems were not designed for analytics. Operators type free-text comments. Planners update fields inconsistently. Vendors configure systems differently. Over time, exports become littered with:
Without cleaning, your downtime analysis, backlog analysis, PM compliance reports, and cost breakdowns are unreliable. Worse, they lead to bad decisions about labour, spares, and corrective strategy.
This is the exact method used by high-performing maintenance and reliability teams.
Every CMMS platform exports differently:
Start by aligning everything to a universal schema.
Recommended standard fields (minimum viable analysis set):
| Category | Field Name |
|---|---|
| Work Order | work_order_id |
| Work Order | status |
| Work Order | priority |
| Work Order | work_type |
| Time | requested_date |
| Time | scheduled_date |
| Time | completed_date |
| Time | downtime_minutes |
| Time | labour_hours |
| Asset | asset_id |
| Asset | asset_name |
| Asset | location |
| Asset | department |
| Failure | failure_code |
| Failure | cause_code |
| Failure | remarks |
| Cost | material_cost |
| Cost | labour_cost |
| Cost | total_cost |
| Metadata | site |
| Metadata | planner |
| Metadata | is_pm |
| Metadata | is_overdue |
Remove duplicates. Rename inconsistent headings. Fill missing but required headings with blank columns.
This is the number one issue across all CMMS platforms.
Common problems:
20250118, Maximo as 18/01/25, Fiix as 2025-01-18T14:33Z.Best practice:
YYYY-MM-DD.Asset naming chaos is one of the biggest blockers to accurate Pareto analysis.
Common inconsistencies:
To clean:
PUMP, PMP, PMP- → PUMP).Result: Your top 10 downtime assets become actual insights, not noise.
Every CMMS has legacy status codes:
Create a normalisation table:
| Raw Status | Standard Status |
|---|---|
| COMP, COMPLETE, C | Completed |
| OPEN, OPN, O | Open |
| CAN, CANCEL, X | Cancelled |
Repeat the same process for priorities and work types.
This ensures apples-to-apples comparison across departments and sites.
Most CMMS datasets contain:
You have three options:
Option A — Standardise manually Slow but accurate. Best for small datasets.
Option B — Use controlled dictionaries Map common terms ("bearing", "seal", "sensor", "belt") to standard categories.
Option C — Use AI-assisted classification (recommended) Modern AI and machine learning models can classify thousands of rows rapidly and accurately. LeanReport uses this method.
Before you begin any Pareto analysis, backlog analysis, or cost modelling, confirm:
YYYY-MM-DD.AUFK, AFVC).A tissue manufacturing plant using SAP PM exported 18 months of breakdown work orders. Initial analysis showed:
After cleaning:
This is the power of proper cleaning.
You can automate:
LeanReport already automates all of this across major CMMS platforms.
PUMP 04 → PUMP-04Map everything to:
Map everything to:
A clean file has:
Once the dataset meets these criteria, CMMS data analysis becomes:
This is the foundation for reliability engineering, lean programmes, and CapEx justification.
Cleaning and preparing CMMS exports is the highest-leverage activity a maintenance planner can undertake. It transforms messy CMMS data into reliable insights that drive decisions on labour, spares, PM reviews, failure modes and capital planning.
The process may seem tedious, but once standardised — or automated — it unlocks the real value trapped in your maintenance history.
LeanReport automatically cleans and prepares CMMS exports from SAP PM, Maximo, MEX, Maintenance Connection, Fiix, UpKeep and more. It standardises fields, fixes dates, normalises asset names, deduplicates work orders, and classifies free-text failures — all before analysis begins.
Upload your file, choose the dataset type, and LeanReport does the rest.
Start your first report at: https://www.leanreport.io
Standardise headings, fix date formats, normalise asset names, unify status codes, resolve duplicates and classify failure text. Automation tools can accelerate this dramatically.
SAP, Maximo and MEX usually require heavy normalisation due to inconsistent naming and free-text fields.
Use rule-based mapping (e.g., "PMP", "PUMP", "PMP-") and convert everything to a standard format such as PUMP-004.
Use ISO format: YYYY-MM-DD. This removes ambiguity across sites and international teams.
Fill blanks with NULL. Never use placeholders like "N/A", as they break automated analysis tools.

Founder - LeanReport.io
Rhys is the founder of LeanReport.io with a unique background spanning marine engineering (10 years with the Royal New Zealand Navy), mechanical engineering in process and manufacturing in Auckland, New Zealand, and now software engineering as a full stack developer. He specializes in helping maintenance teams leverage AI and machine learning to transform their CMMS data into actionable insights.
Mobile technology is reshaping how maintenance teams capture, report and act on data right where the work happens. Learn how mobile reporting drives faster response, better accuracy and stronger decisions.
Learn how to build a strategic annual maintenance report with clear templates, meaningful KPIs and consistent structure that drives reliability and supports decision-making.
Learn a practical, data-driven continuous improvement framework for maintenance teams to reduce downtime and boost reliability using CMMS data.