TY - JOUR
T1 - The statistical prediction of offshore winds from land-based data for wind-energy applications
AU - Walmsley, J.L.
AU - Barthelmie, R.J.
AU - Burrows, W.R.
PY - 2001
Y1 - 2001
N2 - Land-based meteorological measurements at two locations on the Danish coast are used to predict offshore wind speeds. Offshore wind-speed data are used only for developing the statistical prediction algorithms and for verification. As a first step, the two datasets were separated into nine percentile-based bins, with a minimum of 30 data records in each bin. Next, the records were randomly selected with approximately 70% of the data in each bin being used as a training set for development of the prediction algorithms, and the remaining 30% being reserved as a test set for evaluation purposes. The binning procedure ensured that both training and test sets fairly represented the overall data distribution. To base the conclusions on firmer ground, five permutations of these training and test sets were created. Thus, all calculations were based on five cases, each one representing a different random selection from the same data, but maintaining the (approximate) 70-30 split in each bin. This procedure served to ensure that conclusions were not based on a single randomly-selected case. Two statistical methods are employed: multiple linear regression (MLR), and Classification and Regression Trees (CART). MLR produces excellent results using only land-based predictors. The CART results are similar to those from MLR, and tend to be slightly better.
AB - Land-based meteorological measurements at two locations on the Danish coast are used to predict offshore wind speeds. Offshore wind-speed data are used only for developing the statistical prediction algorithms and for verification. As a first step, the two datasets were separated into nine percentile-based bins, with a minimum of 30 data records in each bin. Next, the records were randomly selected with approximately 70% of the data in each bin being used as a training set for development of the prediction algorithms, and the remaining 30% being reserved as a test set for evaluation purposes. The binning procedure ensured that both training and test sets fairly represented the overall data distribution. To base the conclusions on firmer ground, five permutations of these training and test sets were created. Thus, all calculations were based on five cases, each one representing a different random selection from the same data, but maintaining the (approximate) 70-30 split in each bin. This procedure served to ensure that conclusions were not based on a single randomly-selected case. Two statistical methods are employed: multiple linear regression (MLR), and Classification and Regression Trees (CART). MLR produces excellent results using only land-based predictors. The CART results are similar to those from MLR, and tend to be slightly better.
KW - Vindenergi og atmosfæriske processer
U2 - 10.1023/A:1019231625128
DO - 10.1023/A:1019231625128
M3 - Journal article
SN - 0006-8314
VL - 101
SP - 409
EP - 433
JO - Boundary-Layer Meteorology
JF - Boundary-Layer Meteorology
IS - 3
ER -