Links to specific topics

Thursday, February 4, 2010

Reading data into WarpPLS: An easy and flexible step

Through Step 2, you will read the raw data used in the SEM analysis. While this should be a relatively trivial step, it is in fact one of the steps where users have the most problems with other SEM software. Often an SEM software application will abort, or freeze, if the raw data is not in the exact format required by the SEM software, or if there are any problems with the data, such as missing values (empty cells).

WarpPLS employs an import "wizard" that avoids most data reading problems, even if it does not entirely eliminate the possibility that a problem will occur. Click only on the “Next” and “Finish” buttons of the file import wizard, and let the wizard to the rest. Soon after the raw data is imported, it will be shown on the screen, and you will be given the opportunity to accept or reject it. If there are problems with the data, such as missing column names, simply click “No” when asked if the data looks correct.

Raw data can be read directly from Excel files, with extension “.xls” (or newer Excel extensions), or text files where the data is tab-delimited or comma-delimited. When reading from an “.xls” file, make sure that the spreadsheet file that has your numeric data is in the first worksheet; that is, if the spreadsheet has multiple worksheets. Raw data files, whether Excel or text files, must have indicator names in the first row, and numeric data in the following rows. They may contain empty cells, or missing values; these will be automatically replaced with column averages in a later step (assuming that the default missing data imputation algorithm is used).

One simple test can be used to try to find out if there are problems with a raw data file. Try to open it with a spreadsheet software (e.g., Excel), if it is originally a text file; or try to create a tab-delimited text file with it, if it is originally a spreadsheet file. If you try to do either of these things, and the data looks messed up (e.g., corrupted, or missing column names), then it is likely that the original file has problems, which may be hidden from view. For example, a spreadsheet file may be corrupted, but that may not be evident based on a simple visual inspection of the contents of the file.


Tashera said...


I am importing my raw file into WarpPLS. I already have values assigned for missing data (-9, -8, etc), but during standardization, those values are changed. How do I identify those as missing values prior to standardization? Or, do I have to create a new file that has blanks for all missing values prior to importing into WarpPLS?

Ned Kock said...

Regarding missing values, simply leave the corresponding cells empty.

Đào Duy Tùng said...

hello, I new beginner, I cannot import data from file xlsx, it always showing message "file contains uninterpretable data". I already checked many time but dont know how to correct the data.

If I use this file to smartpls, the software can read very well.

Any advice?

Ned Kock said...

Hi Dao. Please check this video:

If you still have problems, you may want to contact Support and send them your data file: