Data Transformation

Custom Search

Get Free Six Sigma Tools Here

Data Transformation


Purpose

Data transformation is used to transform data that is not normally distributed into data that follows a normal distribution. Thus allows us to calculate basic statistics and valid probabilities related to the population (mean, standard deviation, Z values, probabilities for defects, yield, etc.).


Anatomy

how to transform data

Reference: Juran's Quality Control Handbook

Terminology

A. Mathematical transformations - Prior to using these transformations consult the Application Cookbook.

B. Range of variable studied.

C. Original distribution of the variable.

D. Resulting distribution after applying a mathematical transformation.


Major Considerations

Before using a mathematical transformation apply steps 1 and 2 of the Application Cookbook.

Application Cookbook

If the data is not normally distributed (e.g. fails the normality test in Minitab) conduct the following steps:

1. Examine the data to see if there is a nonstatistical explanation for the unusual distribution pattern. For example, if data is collected from various sources (similar machines or individuals performing the same process) and each one has a different mean or standard deviation, then the combined output of the sources will have an unusual distribution such as a mixture of the individual distributions. In this case, separate analyses could be made for each source (individual, machine, etc.).

2. Analyze the data in terms of averages instead of individual values. Sample averages closely follow a normal distribution even if the population of individual values from which the sample averages came is not normally distributed. If conclusions on a characteristic can be made based on the average value proceed but remember these only apply to the average value and not to the individual values in the population.

3. If steps 1 and 2 do not provide with reliable estimates, use the Weibull distribution. Consult the Application Cookbook of the tool Distribution - Weibull. The resulting straight line can provide estimates of the probabilities for the population.

4. If all above steps fail in providing reliable estimates, use one of the most common mathematical transformations which include:

From Data Transformation to Free Six Sigma Tools.

Recent Articles

  1. Quality Control Plan - Simple Method To Create Effective Plans

    May 10, 16 09:24 PM

    A Quality Control Plan is a documented description of the activities needed to control a process or product. The objective of a QCP is to minimize variation.

    Read More

  2. Get Free Six Sigma Tools

    May 10, 16 08:49 PM

    The Largest Collection of Free Six Sigma Tools and Training on the Web!

    Read More

  3. Weibull Distribution

    May 10, 16 07:28 PM

    The Weibull distribution is applicable to make population predictions around a wide variety of patterns of variation.

    Read More