About - MLPipeLab

Origin

Why we built this

Ryan Nakamura spent six years as a data engineer at a 3PL network handling fulfillment for mid-market retailers. Every time a new client onboarded, the answer was the same: write a Python script to pull from their WMS, clean the schema mismatches by hand, load it into the shared data warehouse. Repeat for every schema version change. Repeat when the WMS vendor released a patch. Repeat when the client added a new facility.

By 2022, Ryan's team had accumulated 43 bespoke integration scripts, none of them monitored, half of them running on servers nobody owned anymore. When a major client's inventory counts drifted by 12% because a WMS upgrade silently renamed a column, it took three days to trace the cause.

MLPipeLab started as an internal tool in early 2023 — a framework for schema-aware pipeline management with drift detection built in. By the time Ryan had used it across 8 client integrations, it was clear the problem wasn't specific to his 3PL. It was endemic to logistics data operations at any organization running more than two software systems.

We incorporated in late 2023, brought on a small engineering team, and shipped the first commercial version in Q1 2024.

Supply chain engineer reviewing data dashboards

By the Numbers

Where we are today

34 Active production deployments

12 Certified WMS connectors

2023 Founded in Seattle

6 Team members

Results

What our customers have achieved

Regional 3PL — Pacific Northwest

Managing 11 client accounts, each with a different WMS configuration. The data engineering team spent 40% of their time maintaining integration scripts instead of building analytics.

MLPipeLab replaced 9 bespoke scripts with certified connectors. Schema drift alerts now catch WMS version mismatches before they corrupt the shared warehouse.

41% reduction in data engineering maintenance hours per month.

Mid-Size Distribution Center — Midwest

Running SAP EWM alongside a legacy TMS from a vendor that hadn't updated its EDI documentation in 4 years. Month-end inventory reconciliation took 3 days because no automated pipeline linked the two systems.

MLPipeLab's SAP EWM connector and the legacy TMS generic adapter normalized both schemas into a shared staging layer. Reconciliation now runs automatically overnight.

3 days → 4 hours for monthly inventory reconciliation.

Freight Brokerage — Texas

Processing X12 214 status updates from 140 active carrier relationships. Each carrier's EDI feed had minor variations. The operations team maintained a manual exception log for feeds that failed validation weekly.

MLPipeLab's EDI normalization layer absorbed carrier-side variations. The exception log went from 18 entries per week to 2, both traceable to carriers with documented non-standard implementations.

89% reduction in weekly EDI exception log entries.

Principles

How we work

Observable by Default

Every pipeline run produces a validation report. We don't consider a feature done until it logs what it did and why it made each decision. Silent failures are unacceptable in production logistics data.

Domain Before Generic

We build for logistics specifically. A general-purpose ETL tool doesn't know that Manhattan Associates uses "LPN" where SAP uses "HU". We do. That knowledge is baked into the connector layer.

Machine Proposes, Human Approves

The ML mapping engine speeds up schema reconciliation, but field mappings that affect production data should have a human sign-off. We don't hide the confidence score — it's always visible before you approve a mapping.

Minimal Configuration Surface

Every configuration option we add is a potential source of user error. We try to eliminate options by having sensible defaults for logistics-specific patterns — and to document clearly when you actually need to override them.

We'd rather show you than tell you.

Connect us to your WMS and we'll run a live schema discovery in 30 minutes. No data leaves your environment during the demo if you use agent mode.

Request a Demo

About MLPipeLab