TY - JOUR

T1 - The statistical prediction of offshore winds from land-based data for wind-energy applications

AU - Walmsley, J.L.

AU - Barthelmie, R.J.

AU - Burrows, W.R.

PY - 2001

Y1 - 2001

N2 - Land-based meteorological measurements at two locations on the Danish coast are used to predict offshore wind speeds. Offshore wind-speed data are used only for developing the statistical prediction algorithms and for verification. As a first step, the two datasets were separated into nine percentile-based bins, with a minimum of 30 data records in each bin. Next, the records were randomly selected with approximately 70% of the data in each bin being used as a training set for development of the prediction algorithms, and the remaining 30% being reserved as a test set for evaluation purposes. The binning procedure ensured that both training and test sets fairly represented the overall data distribution. To base the conclusions on firmer ground, five permutations of these training and test sets were created. Thus, all calculations were based on five cases, each one representing a different random selection from the same data, but maintaining the (approximate) 70-30 split in each bin. This procedure served to ensure that conclusions were not based on a single randomly-selected case. Two statistical methods are employed: multiple linear regression (MLR), and Classification and Regression Trees (CART). MLR produces excellent results using only land-based predictors. The CART results are similar to those from MLR, and tend to be slightly better.

AB - Land-based meteorological measurements at two locations on the Danish coast are used to predict offshore wind speeds. Offshore wind-speed data are used only for developing the statistical prediction algorithms and for verification. As a first step, the two datasets were separated into nine percentile-based bins, with a minimum of 30 data records in each bin. Next, the records were randomly selected with approximately 70% of the data in each bin being used as a training set for development of the prediction algorithms, and the remaining 30% being reserved as a test set for evaluation purposes. The binning procedure ensured that both training and test sets fairly represented the overall data distribution. To base the conclusions on firmer ground, five permutations of these training and test sets were created. Thus, all calculations were based on five cases, each one representing a different random selection from the same data, but maintaining the (approximate) 70-30 split in each bin. This procedure served to ensure that conclusions were not based on a single randomly-selected case. Two statistical methods are employed: multiple linear regression (MLR), and Classification and Regression Trees (CART). MLR produces excellent results using only land-based predictors. The CART results are similar to those from MLR, and tend to be slightly better.

KW - Vindenergi og atmosfæriske processer

U2 - 10.1023/A:1019231625128

DO - 10.1023/A:1019231625128

M3 - Journal article

SN - 0006-8314

VL - 101

SP - 409

EP - 433

JO - Boundary-Layer Meteorology

JF - Boundary-Layer Meteorology

IS - 3

ER -