Home
Softono
Units_of_Measure_Harmonization-intelligence-platform

Units_of_Measure_Harmonization-intelligence-platform

Open source Apache-2.0 PowerShell
812
Stars
754
Forks
0
Issues
21
Watchers
3 weeks
Last Commit

About Units_of_Measure_Harmonization-intelligence-platform

The Units of Measure Harmonization Intelligence Platform is a production-grade machine learning system designed to automate the detection and correction of Unit of Measure errors in manufacturing and procurement data. Built on KNIME Analytics Platform 4.5 or higher, the system achieves 88 to 92 percent accuracy with 94 percent operational autonomy. It addresses critical issues such as order fulfillment disasters, inventory chaos, supply chain disruptions, financial losses, and compliance violations caused by misplaced decimals or incorrect units. The platform utilizes an ML classification engine powered by XGBoost with over 60 engineered features and processes up to 3,300 records per minute. It incorporates NIST-compliant physics-based validation rules and a Q-learning reinforcement learning agent for high-level automation. A key feature is the interactive visual dashboard integrated directly into the workflow, which provides real-time error detection, confidence scoring, auto-correction tracking, root cause

Platforms

Web Self-hosted

Languages

PowerShell

Links



Units of Measure Harmonization Intelligence Platform

Production-Grade ML System for Automated UOM Error Detection

License GitHub release GitHub stars GitHub forks KNIME

Quick Start | Demo | Features | Docs | Contribute


Manufacturing and procurement organizations lose millions annually due to Unit of Measure (UOM) errors.

A single misplaced decimal or wrong unit causes:

  • Order fulfillment disasters (50kg vs 50lbs)
  • Inventory chaos (overstocking/understocking)
  • Supply chain disruptions (wrong quantities shipped)
  • Financial losses (incorrect billing, waste)
  • Compliance issues (regulatory violations)

See It In Action

Visual Dashboard Demo

Real-time visual dashboard showing UOM error detection, classification, and correction in action

Dashboard Features

The interactive visual dashboard provides:

  • Real-time error detection - See UOM issues as they're identified
  • Confidence scoring - ML probability (0-100%)
  • Auto-correction tracking - Watch the system fix errors
  • Root cause analytics - Understand WHY errors occur
  • Intuitive visualizations - Color-coded, charts, statistics
  • Performance metrics - Speed, accuracy, autonomy

Built into the KNIME workflow - zero additional setup needed!

Full Dashboard Guide


System Architecture

KNIME Workflow Architecture

Complete KNIME workflow showing data pipeline, ML engine, and automation components

Key Components

  • Data Ingestion - CSV/Excel with validation
  • ML Classification Engine - 60+ features, XGBoost
  • Physics Validation - NIST-compliant conversion rules
  • Reinforcement Learning - Q-learning autonomy agent
  • Interactive Dashboard - Real-time visualization

Detailed Architecture


Quick Start

Get running in under 5 minutes:

# 1. Clone repository
git clone https://github.com/JulietMirambo/Units_of_Measure_Harmonization-intelligence-platform.git
cd Units_of_Measure_Harmonization-intelligence-platform

# 2. Import into KNIME Analytics Platform (4.5+)
# File -> Import KNIME Workflow -> Select 'workflow' folder

# 3. Execute with sample data
# Right-click workflow -> Execute -> Select data-sample/sample_10k.csv
# Results in 2-3 minutes!

Detailed Installation Guide


Technology Stack

  • ML Engine: 60+ engineered features, XGBoost classifier, 5-fold cross-validation
  • Processing: 3,300 records per minute throughput
  • Validation: NIST-compliant physics-based conversion engine
  • Autonomy: Q-learning reinforcement learning agent (94% automation)
  • Dashboard: Interactive JavaScript visualization with real-time updates
  • Platform: KNIME Analytics 4.5+

What Makes This Special

  • Visual Intelligence: Watch errors being caught and corrected in real-time
  • Enterprise-Ready: Handles millions of records with consistent performance
  • Self-Learning: ML model improves accuracy over time with feedback
  • Zero Configuration: Works out of the box with sensible defaults
  • Production-Tested: Battle-hardened on real manufacturing data
  • Open Source: Free for commercial use under MIT license

Usage

Basic Workflow

  1. Import your data (CSV/Excel with UOM column)
  2. Execute the workflow (one-click execution)
  3. View results in the interactive dashboard
  4. Export corrections to apply to your system

Example Results

Input:  "50 KG" (should be "50 EA")
Output: Detected | Corrected | Confidence: 94%

Processing: 10,000 records
Time: 3 minutes
Errors Found: 847 (8.47%)
Auto-Corrected: 796 (94%)
Manual Review: 51 (6%)

Supported Formats

  • CSV files (UTF-8, any delimiter)
  • Excel files (.xlsx, .xls)
  • Tab-separated values
  • Pipe-delimited files

Error Types Detected

  • Decimal Errors - 50.0 vs 50 EA
  • Unit Mismatches - KG vs EA, LBS vs KG
  • Conversion Issues - Imperial/Metric confusion
  • Format Problems - Spacing, capitalization
  • Missing Units - Blank or null UOM fields

More Examples & Use Cases


Documentation

Getting Started

In-Depth Guides

Support


Use Cases

This platform solves UOM problems across industries:

Manufacturing

  • Production Planning - Prevent material ordering errors
  • Inventory Management - Clean SKU master data
  • Bill of Materials - Standardize component units

Supply Chain

  • Order Fulfillment - Fix quantity discrepancies
  • Demand Forecasting - Ensure data consistency
  • Multi-vendor Integration - Harmonize supplier data

Procurement

  • Purchase Orders - Validate unit specifications
  • Contract Management - Standardize terms
  • Spend Analysis - Accurate cost calculations

Data Quality

  • Data Migration - Clean legacy systems
  • Healthcare - Standardize medical units
  • Research - Ensure measurement accuracy

ROI: Organizations report:

  • 60-80% reduction in UOM errors
  • 50%+ time savings on data quality tasks
  • 90%+ reduction in order fulfillment issues
  • Significant cost savings (millions in prevented losses)


Academic Citation

License

Apache License 2.0 - Free for commercial use!

This means you can:

  • Use commercially without restrictions
  • Modify and distribute freely
  • Distribute your own versions (with attribution)
  • Use privately in your organization
  • Grant sublicenses as needed
  • Patent use is explicitly granted

Key conditions:

  • Must include a copy of this license in distributions
  • Modified files must carry prominent change notices
  • Copyright and license notices must be preserved

Full License Text

@software{mirambo2025uom,
  author = {Mirambo, Juliet Bosibori},
  title = {Units of Measure Harmonization Intelligence Platform},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/JulietMirambo/Units_of_Measure_Harmonization-intelligence-platform}
}

Roadmap

Current Version (v1.0)

  • ML-powered error detection
  • Interactive dashboard
  • KNIME workflow automation
  • Sample datasets

Upcoming Features

  • API endpoint for integration
  • Multi-language support
  • Mobile dashboard
  • Advanced RL algorithms
  • Cloud deployment options

View Full Roadmap


Back to Top


Repository Topics: machine-learning | data-quality | knime | automation | manufacturing | supply-chain | data-cleaning | unit-conversion | artificial-intelligence | production-ready