964 VOLUME 14J O U R N A L O F C L I M A T E q 2001 American Meteorological Society Quality Control and Homogeneity of Precipitation Data in the Southwest of Europe J. FIDEL GONZÁLEZ-ROUCO Departamento de Astrofı́sica y Ciencias de la Atmósfera, UCM, Madrid, Spain, and GKSS Forschungzentrum, Geesthacht, Germany J. LUIS JIMÉNEZ Departamento de Astrofı́sica y Ciencias de la Atmósfera, UCM, Madrid, Spain VICENTE QUESADA Departamento de Estadı́stica e Investigación Operativa, UCM, Madrid, Spain FRANCISCO VALERO Departmento de Astrofı́sica y Ciencias de la Atmósfera, UCM, Madrid, Spain (Manuscript received 19 July 1999, in final form 19 April 2000) ABSTRACT A quality control process involving outliers processing, homogenization, and interpolation has been applied to 95 monthly precipitation series in the Iberian Peninsula, southern France, and northern Africa during the period 1899–1989. A detailed description of the procedure results is provided and the impact of adjustments on trend estimation is discussed. Outliers have been censored by trimming extreme values. Homogeneity adjustments have been developed by applying the Standard Normal Homogeneity Test in combination with an objective methodology to select ref- erence series. The spatial distribution of outliers indicates that they are due to climate variability rather than measurement errors. After carrying out the homogeneity procedure, 40% of the series were found to be homogeneous, 49.5% became homogeneous after one adjustment, and 9.5% after two adjustments. About 30% of the inhomogeneities could be traced to information in the scarce history files. It is shown that these data present severe homogeneity problems and that applying outliers and homogeneity adjustments greatly changes the patterns of trends for this area. 1. Introduction In the last few decades there has been an increasing concern regarding the changes in climate caused by an- thropogenic emissions of greenhouse gases and aerosols (Houghton et al. 1992, 1995). This interest has been mainly based on the results of climate models. In a similar fashion, there has been a rise in the activity directed toward climate change detection in observed data (Santer et al. 1995; Hegerl et al. 1996). It is es- sential, for this purpose, to use high quality observa- tions. Thus outliers and homogenization arise as im- portant issues. Corresponding author address: J. Fidel González-Rouco, GKSS Forschungzentrum, GUA, GEB. 38, Max-Planck-Str. 1, 21502 Gees- thacht, Germany. E-mail: fidel@gkss.de Historically, the identification of outliers has been the primary emphasis of quality control work (Filippov 1968; Grant and Leavenworth 1972). Outliers are ob- servations very distant from the mean value that can be due to measurement errors or to extreme meteorological events. Several approaches that focus on temporal and/ or spatial variability can be applied in order to identify outliers and diagnose whether they are erroneous or not (Barnett and Lewis 1994; Eischeid et al. 1995; Peterson et al. 1998a). When outliers are undoubtedly erroneous measurements those extreme data can be rejected and the problem is converted into one of missing data treat- ment. When outliers have a physical background the question arises whether they should be corrected or not (Barnett and Lewis 1994). On the one hand, extreme data carry very valuable climatological information that should not be dismissed. On the other hand, many sta- tistical techniques of common use are not resistant, that is, they are sensitive to the presence of outliers that can 1 MARCH 2001 965G O N Z Á L E Z - R O U C O E T A L . affect the estimation of sample statistics (Lanzante 1996). Such is the case of all moments-based statistics (Peterson et al. 1998a). The last issue can be approached in several ways. The most objective approach would be always to use tech- niques that are resistant (Lanzante 1996). Unfortunately this is not the case for many of the available methods. An alternative and rather radical solution would be re- moving all outliers. This would allow a safe use of nonresistant statistics at the expense of losing interesting climatological information. Another alternative that would combine keeping the information of extreme data and showing consideration for nonresistant methods would be to censor outliers by means of replacing them by some threshold value that keeps the information of an extreme event and yet does not have such an im- portant influence on nonresistant statistics (Barnett and Lewis 1994). The original value of the outliers can be later restored for specific studies concerning extreme values. This last subjective approach has been the one used herein as a preprocessing for the nonresistant ho- mogenization techniques employed in this work. A homogeneous climate series is defined as one where variations are caused only by changes in weather and climate (Conrad and Pollak 1950). This would mean that the measurements have been consistently registered by the same method, with the same instrumentation, at the same time of day and place and in the same envi- ronment. Unfortunately, this is seldom the case and at most old observatories, instruments, location, observer, or other environmental factors have been altered (Jones et al. 1985; Karl and Williams 1987; Peterson et al. 1998a). In the case of precipitation, progressive im- provements of instrumentation can also introduce arti- ficial systematic increases (Hanssen-Bauer and Førland 1994), and thus long-term variations and trends should be interpreted cautiously (Houghton et al. 1992). Ac- curate long-term climate analysis requires homogeneous data, thus, inhomogeneities must be detected and in- homogeneous series be excluded or adjusted. Several statistical tests have been developed that al- low detection of inhomogeneities in time series. Some depend on the metadata (Karl and Williams 1987; Rhoades and Salinger 1993) and others use them as additional information when the station history is poorly documented or missing at all (Buisband 1982; Alex- anderson 1986). Some tests are univariant, while others use information from reference time series that are com- pared with the test series to decide upon its quality. A comprehensive review on this topic has been presented by Peterson et al. (1998b). Considerable research has been accomplished on the characteristics of low-frequency climate variability of precipitation in the western Mediterranean (Maheras 1988; Valero et al. 1993; Zorita et al. 1992; Fernández- Mills 1995; Valero et al. 1996b; Rodriguez-Puebla et al. 1998; Serrano et al. 1999; González-Rouco et al. 2000). However, homogeneity assessments of historical precipitation data during the twentieth century have sel- dom been performed (Almarza et al. 1996; Garrido et al. 1996; Esteban-Parra et al. 1998). Thus, a detailed description of this problem from the perspective of an objective methodology using all the information from a high number of stations is still to come. In this work, we focus on the detection and treatment of outliers and inhomogeneities in a dataset of precip- itation in the southwest of Europe. The adjustment of outliers has been performed by taking into account dis- persion in the time series by means of windsorization, that is, trimming the values that exceed a certain thresh- old (Barnett and Lewis 1994). For detection and pro- cessing of inhomogeneities we have used a procedure based upon the Standard Normal Homogeneity Test de- veloped by Alexanderson (1986). This method assumes invariance of the ratio of the values of the test series and those of a reference series built using information from several neighboring stations. An inhomogeneity in the test series will be revealed by a systematic change in this ratio. However, if there exists no a priori knowl- edge of which ones are the homogeneous time series, the test has to be applied several times to decide which stations have good quality records and can be used as a reference. We have followed a procedure divided in six steps that allows identifying reference series and adjusting inhomogeneous ones. The first five steps fol- low those of Hanssen-Bauer and Førland (1994) and the sixth one has been added to account for series with two inhomogeneities. It is worthwhile making a comment on the limits of the corrections applied in this work. The procedures herein are applied with the intention of improving the quality of the data, but not of providing an error-free dataset. The use of the term ‘‘correct’’ should not be taken to imply a level of perfection in the adjustments that does not exist in any dataset. Future inclusion of new data and/or the use of other methods will hopefully lead to further improvements. Data are described in the next section. A description of the methodology is presented in section 3. Results concerning outliers and homogenization are presented in the following section as well as a description of their impact on trends. Finally conclusions are summed up and discussed. 2. Data We have used long time series of monthly precipi- tation totals (mm) covering most of the Iberian Penin- sula, south of France and north of Africa and spanning the period 1899–1989. Iberian rain gauge data were provided by the Instituto Nacional de Meteorologia (Spain). The Carbon Dioxide Information Analysis Cen- ter (United States) and Metéofrance (France) supplied African and French precipitation data, respectively. Fig- ure 1 shows the spatial distribution of the 95 observa- tories; the corresponding names and lengths of the as- 966 VOLUME 14J O U R N A L O F C L I M A T E FIG. 1. Spatial distribution of precipitation stations. Numbers refer to those of Table 1. Circles point time series in which information from secondary sites was used. sociated time series are listed in Table 1. We have se- lected the period 1899–1989 for this first version of the data. Noncircled numbers in Fig. 1 stand for sites in which time series were used unchanged to apply the procedures described in the next section. In some cases (circled identifiers in Fig. 1) long data gaps were present and/ or temporal extension of the time series was desirable. In order to improve the quality of these records and also the spatial coverage of the dataset additional information from 43 secondary sites (nonnumbered rows in Table 1) was used. Thus, initial corrections following the ratio method (Mitchell et al. 1966) were calculated and ap- plied using data from the overlapping periods between the base stations and the additional observatories. The lower threshold value for correlation between the base station time series and the time series of a candidate secondary site to be used for data interpolation was 0.8. 3. Methodology In this section a brief description is provided on the development of outliers and homogeneity corrections. a. Outliers Historically, the identification of outliers has been the primary emphasis of quality control work (Grant and Leavenworth 1972). Outliers can be erroneous mea- surements or correct extreme values. In this version of the dataset the approach has not been searching for er- roneous measurements, but reducing the size of the dis- tribution tails in order to make a safer use of the non- resistant homogenization techniques used later. This has been performed by trimming extreme data by a thresh- old value. This extreme value keeps the information of an extreme event, but its reduced magnitude in com- parison with the original value has less impact on non- resistant statistics (Lanzante 1996). In the next section the spatial distribution of values identified as outliers will be described showing that in most cases the trimmed values have a physical meaning. Outliers were identified as those values trespassing a maximum threshold for each time series (Trenberth and Paolino 1980; Peterson et al. 1998a) defined by Pout 5 q0.75 1 3IQR, (1) where q0.75 is the third quartile and IQR the interquartilic range. The IQR has been used in quality control of climate data (Eischeid et al. 1995) because it is resistant to outliers. Values over Pout were substituted by this limit. This way of proceeding reduces the bias caused by outliers and yet keeps the information of extreme events (Barnett and Lewis 1994). b. Homogenization 1) THE STANDARD NORMAL HOMOGENEITY TEST The homogenization procedure is based on the ap- plication of the Standard Normal Homogeneity Test (Al- exanderson 1986). This test is based on the assumption that precipitation amounts at the station being tested (test station) and some regional average values are pro- portional to each other. This relationship is expressed in terms of the ratio q between the test station normal- ized precipitation values and those of a regional time series defined as a weighted average of several neigh- boring reference stations. Thus, the ratio q in a specific year may be denoted as Fiq 5 , i 5 1, . . . , n, (2)i Gi n being the number of time steps, and Fi and Gi functions of the precipitation at the test and reference stations that are defined as ki Qij yO j QP j51 jiF 5 G 5 , (3)i i kiP yO j j51 Pi being the precipitation at the test station, Qij the pre- cipitation at the jth reference station, and y j a weight factor for the jth reference station. In this work y j was defined as the square of the correlation coefficient be- tween the test series and the jth reference series. Here ki is the number of reference sites used in time step i that varies as a result of different length of time series and the existence of missing data. Overbars denote av- erages throughout the observational period. In order to apply the test the qi values are standardized by their sample mean, q , and standard deviation, sq: (q 2 q )iz 5 .i sq This series has zero mean and unit standard deviation 1 MARCH 2001 967G O N Z Á L E Z - R O U C O E T A L . and is assumed to be normally distributed, N(0, 1). An inhomogeneity in the test series would affect the series of ratios and thus zi. Therefore, the homogeneity test is based on the definition of the following hypotheses. Null hypothesis, (H0): The test series is homogeneous. Any subset of zi is distributed as N(0, 1). Alternative hypothesis, (H1): The test series is inho- mogeneous. There is an inhomogeneity in year m such that the m first years of the zi have mean value m1 and the (n 2 m) last years have mean value m2, with the standard deviation 1 in both temporal segments. These definitions enable the test to detect and adjust only one inhomogeneity in the original test series. A test statistic, Tm, is computed for each of the n 2 1 possible change points: Tm 5 m 1 (n 2 m) , m 5 1, . . . , n 2 1,2 2z z1 2 (4) where z 1 and z 2 are the mean values of zi during the m first and the (n 2 m) last years, respectively. A high T value in year m implies that m1 and m2 depart signifi- cantly from zero, making H0 unlikely. The maximum Tm value in the time series will be denoted by Tx. The probability that Tx exceeds a certain value if H0 is correct depends only on the length of the time series. The critical T values for the 5% and 10% significance levels (T95 and T90, respectively) are given by Alexanderson (1986). In practice, the test is applied making use also of the existing information about the history of the station, the metadata. A precipitation se- ries is then classified as inhomogeneous when (i) it con- tains an inhomogeneity significant at the 5% level 5 yr or more from either end of the series, and (ii) the series contains an inhomogeneity significant at the 10% level, which is explainable by metadata. The interaction with the metadata information at the 10% level reduces the probability of rejecting homo- geneous series, thus increasing the power of the test. The reason for not accepting unexplained inhomoge- neities close to the ends of the series is an increased probability for high T values near the ends (Hawkins 1977). Once a series is identified as inhomogeneous, the data before the inhomogeneity date are corrected through multiplying them by qaf 5 , (5) qb where qa and qb are the mean values of qi after and before the inhomogeneity. When the date of a change in the historical records was close to the year corre- sponding to Tx, the historical date was used for the correction. Otherwise, the year corresponding to Tz was assumed. Equations (2) and (3) can also be used for interpo- lating missing data (Alexanderson 1986; Valero et al. 1997a). Assuming q(t) . q 5 1 for missing data, the estimations of the missing Fi values, F̂i, can be obtained from F̂i 5 qiGi . q Gi 5 Gi, (6) and thus precipitation for the test station can be esti- mated from P̂i 5 PGi. (7) 2) SELECTION OF REFERENCE STATIONS The above description states how factors can be cal- culated to produce an homogeneous test series by using information for several reference stations. However, usually no a priori knowledge exists on which ones should form the reference series and the test has to be run in an iterative way several times to decide upon which are the best quality series. We have followed the same procedure as Hanssen-Bauer and Forland (1994) for this purpose. We have added one step to their pro- cedure in order to account also for series with two in- homogeneities. Figure 2 highlights the steps of the meth- od. The three first stages are essentially devoted to im- prove the selection of reference stations: Step 1: A first homogeneity assessment. In this step the test is applied to each time series. The reference series will be those that correlate best with the test sta- tion. Since reference stations were not necessarily ho- mogeneous, the result is a first approach to make a clas- sification of homogeneous (H1; see Fig. 2) and inho- mogeneous (I1) series. Step 2: Using adjusted reference series. Corrections are made to the I1 set following Eq. (5). Subsequently the test is applied to each one of the original time series again though using an improved dataset as reference series: the H1 group and the corrected I1 series. Since this correction reduces the amount of inhomogeneities in the reference set this produces a new, somewhat op- timized, classification into homogeneous (H2) and in- homogeneous (I2) series. The only difference with step 1 is using adjusted reference series. Step 3: Applying the test to corrected series. The I2 group is corrected and then the test is applied only to these adjusted time series; the pool of possible reference stations is built with the H2 set and the adjusted I2 stations. The result is a group of homogeneous time series that have been corrected once (HC3) and a subset of inhomogeneous series with more than one inhomo- geneity (IC3). At this stage of the procedure all time series can be grouped into an homogeneous group (H2 and HC3) and an inhomogeneous one (IC3). The next two steps revise this classification by using as a reference only time se- ries that have proved to be homogeneous after applying the test. Step 4: Testing again using only homogeneous series. All series are checked again: the homogeneous H2 group and the corrected I2 group. The reference series are only those that have proved their homogeneity so far, namely, the H2 and HC3 sets. The H2 set provides two new subsets after applying the test, a group of homogeneous 968 VOLUME 14J O U R N A L O F C L I M A T E TABLE 1. Names and periods of the time series used in the com- pilation of the precipitation dataset. Numbers match those in Fig. 1. Correlation values between main sites and additional secondary ob- servatories are also shown. No. Name Period Correlation values 1 Lyon 1841–1994 2 Grenoble 1833–1976 3 Bourdeaux 1899–1994 4 Gap 1899–1992 5 Bagnols Les Bains 1899–1993 6 Joyeuse 1900–93 Service des crues 1900–30 0.98 7 Cahors 1899–1993 8 Avignon 1899–1993 9 Saint Server 1899–1993 10 Nice 1899–1993 Rue Gioffredo 1891–1920 0.90 11 Toulouse 1899–1994 12 Peirehorade 1851–1992 13 Montpellier 1899–1993 14 Gijon 1913–94 Cabo Peñas 1914–22 0.76 15 Castropol 1907–94 Tapia de Casariego 1907–25 0.80 Castropol 1923–72 0.95 16 Santander 1912–91 17 Marselle 1899–1994 18 Revel 1899–1973 19 Coruña 1899–1993 Bilbao sondica 1899–1993 Guernica 1925–30 0.86 Larrasquitu 1925–46 0.86 Machichaco 1913–30 0.77 Basauri 1940–50 0.93 21 San sebastian 1899–1993 22 Toulon 1899–1994 23 Lugo 1913–94 Mundı́n 1913–21 0.92 Brigós 1922–39 0.94 Sarria 1952–66 0.91 24 Reinosa 1911–91 25 Camporredondo 1917–92 26 Santiago 1906–93 Universidad 1899–1950 0.89 27 Cerv. pisuerga 1912–86 28 Pamplona 1899–1993 29 Canfranc 1910–94 30 Perpignan 1899–1993 Pont Rouge 1899–1923 0.98 31 Oña 1900–94 32 Leon 1904–93 33 Capdella 1915–91 34 Logroño 1911–94 Instituto 1911–47 0.85 35 Pontevedra 1899–1994 Salcedo 1949–85 0.96 36 Burgos 1899–1991 37 Orense (Granja) 1899–1994 Santuario Molas 1924–25 0.90 Beariz 1926–35 0.87 Villamayor 1937–50 0.90 Ginzo 1969–72 0.91 38 Huesca 1899–1994 39 Palencia 1913–88 40 Gerona 1911–91 Instituto 1911–69 0.89 41 Soria 1899–1994 42 Zaragoza 1899–1994 TABLE 1. (Continued) No. Name Period Correlation values Facultad de ciencias 1899–1941 0.89 43 Valladolid 1899–1994 44 Lerida 1913–91 45 Vid de aranda 1899–1994 Col. Franciscanos 1914–39 0.85 Aranda del Duero 1943–94 0.84 46 Zamora 1909–94 47 Barcelona 1899–1994 Universidad 1899–1937 0.91 48 Daroca 1899–1993 49 Segovia 1899–1994 50 Salamanca 1899–1993 Observatorio 1899–1944 0.94 51 Tortosa 1899–1991 52 Avila 1900–94 53 Guadalajara 1912–91 Atienza 1937–45 0.81 54 Madrid 1899–1994 55 Teruel 1899–1994 Ariño 1936–43 0.90 56 Hervas 1913–94 Bejar 1913–79 0.90 Aldea del Camino 1951–54 0.90 57 Coimbra 1899–1989 58 Cuenca 1908–90 59 Castellón 1911–93 60 Talavera 1913–90 Candeleda 1936–40 0.90 Robledillo de la Vera 1940–42 0.91 Marrupe 1944–47 0.87 Alcañizo 1959–62 0.96 Cabañuelas 1989–90 0.85 61 Mahón 1899–1994 62 Toledo 1908–91 63 Palma mallorca 1899–1993 64 Caceres 1904–91 65 Valencia 1899–1994 66 Ciudad real 1904–91 67 Albacete 1899–1991 68 Badajoz 1899–1994 69 Alange 1913–94 Valencia del Ventoso 1913–34 0.89 Miajadas 1917–50 0.91 70 Almaden 1913–94 Pozoblanco 1936–94 0.84 71 Lisboa 1899–1989 72 Alicante 1899–1993 73 Cehegim 1913–94 74 Murcia 1899–1993 75 Cazorla 1911–94 76 Aracena 1913–94 Galaroza 1960–94 0.95 Valdeazufre 1964–94 0.96 77 Cordoba 1911–94 78 Jaen 1899–1993 79 Huelva 1903–94 80 Granada 1899–1993 Alhendim 1977–90 0.93 81 Sevilla 1899–1994 82 Almeria 1910–94 Estación sismológica 1909–32 0.89 Ciudad Jardı́n 1933–67 0.89 83 Grazalema 1912–94 84 Malaga 1900–95 85 Dar-el-beida 1838–1994 86 Algiers 1889–1994 1 MARCH 2001 969G O N Z Á L E Z - R O U C O E T A L . TABLE 1. (Continued) No. Name Period Correlation values 87 San Fernando 1899–1989 88 Constantinopla 1899–1994 89 Gibraltar 1899–1994 90 Tanger 1899–1989 91 Oran 1899–1994 92 Tlemcem 1899–1989 93 Biskra 1902–94 94 Djelfa 1899–1994 95 Casablanca 1902–89 FIG. 2. Diagram showing the six steps of the homogenization procedure [after Hanssen-Bauer and Førland (1994)]. Test series are tested by comparing with reference series and sorted into output groups at each step of the procedure for subsequent use. The output groups are identified by the result of the test (H for homogeneous and I for inhomogeneous), by the number of corrections applied to the input test series (C for one correction and CC for two corrections) and by the number of the step in which the test is performed. Independently of this convention symbols indicate the number of corrections applied to any input series. series (H4) and a group of series that contain at least one inhomogeneity (I4). The adjusted I2 is divided into a group of homogeneous series (HC4) and a set of series that have at least two inhomogeneities (IC4). Step 5: Last corrections for one inhomogeneity. The inhomogeneous group I4 is corrected and checked for new inhomogeneities. The reference series are taken from the just updated homogeneous (H4) and homo- geneous after one correction (HC4) sets. In steps 3–5 corrections for series with one inho- mogeneity are applied. Step 6 has been incorporated to correct two inhomogeneities. Step 6: Correcting two inhomogeneities. The test is applied to parts of the series containing only one in- homogeneity. First, the data before the first inhomo- geneity in series belonging to the groups IC4 and IC5 are omitted and the remaining data are corrected using as reference series those classified as homogeneous in the five preceding steps (H4, HC4, HC5); second, the test is run with all the data in order to correct the first inhomogeneity. 4. Results a. Distribution of outliers Figure 3a shows the distribution of Pout values [Eq. (1)]. This plot describes the variability of the data that reaches maximum values in the northwest and south of the Iberian Peninsula and most of the Pyrenees. Data dispersion achieves its lowest values in the central Pla- teau and along the Mediterranean coast where the fre- quency of extreme events is higher (Linés 1970; Valero 970 VOLUME 14J O U R N A L O F C L I M A T E FIG. 3. (a) Spatial distribution of Pout (mm) values. (b) Spatial distribution of the absolute number of outliers in each station. Contour interval is 100 mm. (c) Time evolution of the monthly absolute num- ber of outliers in the area of interest. Contour interval is 5 units. TABLE 2. Number and type of series involved in each step of the homogenization procedure. Step Input series Test Refer- ence Output series H HC HCC I IC ICC 1 2 3 4 5 6 95 95 53 95 4 10 95 95 95 87 82 85 43 42 — 38 — — — — 45 44 3 — — — — — — 9 52 53 — 4 — — — — 8 9 1 — — — — — — 1 et al. 1997b). This agrees with the distribution of outliers in Fig. 3b that shows a clear maximum in the Mediter- ranean area of influence. The total number of corrected values was 519: 12% of them were produced in spring, 3% in summer, 52% in autumn, and 33% in winter. Figure 3c shows the corresponding monthly distribution, characterized by a maximum in October and a minimum in the summer. The spatial distributions of the number of outliers for each season (not shown here) reveals that maxima are located in the east coast in spring and autumn in a similar fashion as Fig. 3b. In winter they spread over the west and south of the Iberian Peninsula and for the summer months there is no apparent structure but isolated cases in the central plateau. The overall agreement of the areas and timing of out- liers with those of extreme events supports the idea of physical mechanisms as causes for the detected anom- alous data rather than human-induced errors. b. Homogenization For this version of the dataset, homogeneity correc- tions were developed from the annually averaged time series and subsequently applied to monthly data. Month- ly corrections can also be incorporated through aligning the whole series as a sequence of 12n values, n being the number of years or through considering individual monthly series (Alexanderson 1986). In the former case, results support those obtained using annual values, al- though somewhat more significant breaks are obtained. Also, there are cases of dry months giving very uncer- tain ratios that distort the test. This is also the situation when treating individual monthly series. This is relevant when applying the homogenization process described in the last section because it is a non- robust method (distribution dependent). The qi ratios in Eq. (2) are assumed to follow a Gaussian distribution and deviations from it could damage the performance of the test. The problem of very dry months, a frequent one in the southern stations, is reduced when using annual data. After applying a x2 test for the goodness of the fit (Sne- decor and Cochran 1972) of the qi to a normal distri- bution, it was found that for a significance level of 0.1, 88% of the qi series in step 1 [see section 3b(2)] met the Gaussian hypothesis. Since the non-Gaussian 12% is in the range of the significance level, the approxi- mation was made that the set of qi were normally dis- tributed. Table 2 shows the number and type of series involved in each step (column 1) of the homogenization proce- dure. The number of time series tested in each step is shown in column 2 and the number of series making part of the reference group appears in column 3. The quantity of resulting homogeneous (H) or inhomoge- 1 MARCH 2001 971G O N Z Á L E Z - R O U C O E T A L . FIG. 4. (a) Type of time series after applying the homogenization procedure: homogeneous (H), homogeneous after one (HC) or two (HCC) corrections, and inhomogeneous after two corrections (ICC). Numbers in the legend stand for the step of the procedure. Solid symbols indicate agreement with metadata. Isolines indicate the num- ber of reference stations used. Contour interval is 2 units. (b) Cor- relation values between F and G [see Eq. (3)]. Contour interval is 0.1. (c) Distribution of correction factors within the studied period. Crosses stand for inhomogeneities of unknown origin, circles cor- respond to relocations or interpolations of data from distant sites, and triangles to changes in height, the histogram of inhomogeneity dates both for the detected changes (black bars) and those registered in the metadata (white bars). neous (I) time series are indicated in columns 4–9, either after one (HC, IC) or after two (HCC, ICC) corrections. In the two first steps almost the same results are ob- tained. The most relevant changes are shifts in the clas- sification of some series that switch from homogeneous in step 1 (H1) to inhomogeneous in step 2 (I2; stations 42, 51, 72, and 90) or vice versa (stations 13, 23, 34, 79, and 94). In the third stage the 53 inhomogeneous series obtained in step 2 are corrected and tested pro- ducing a subset of 45 homogeneous after one correction (HC3) and 8 inhomogeneous (IC3) series. Following the methodology previously described, in the fourth step the subset of 42 homogeneous series in step 2 (H2) was tested and provided 38 purely homo- geneous series without any corrections (H4) and 4 series that contain at least one inhomogeneity (I4). The 53 I2 series were corrected and tested again and provided 44 series homogeneous after one correction (HC4) and 9 series with at least two inhomogeneities (IC4). During the fifth stage of the method the 4 I4-type series obtained in step 4 were corrected and tested. The output was a set of 3 homogeneous series (HC5) and 1 with at least 2 inhomogeneities (IC5). In the last step, the procedure was applied to the 10 series with more than one inhomogeneity (IC4 and IC5) being 9 of them successfully corrected (HCC6) and only one (station 69, Alange), which showed more than two inhomogeneities (ICC6). This last station was rejected from the dataset. Here 40% of the series were homo- geneous, an additional 49.5% became homogeneous af- ter one correction, and 9.5% more after two corrections. Figure 4a describes the type (symbols) of inhomo- geneity of each time series. It can be appreciated that there is no evident spatial structure in the distribution of inhomogeneities. The same figure shows the number of reference stations used for the analysis of each series (solid lines). This number was imposed to be always between 5 and 10 and was chosen as a function of the correlation (always higher than 0.5) between the test station and any candidate reference station. There is a clear structure, with the highest values in the west of the Iberian Peninsula and the south of France where correlations between pairs of time series are also higher than in any other area. This also corresponds with the areas of highest correlation values between the F and the G series [Eq. (3)] that are shown in Fig. 4b. The high correlations between F and G also support our criterion for selecting reference stations. Figure 4c shows the temporal distribution of correc- tion factors. Inhomogeneities that are not supported by metadata are identified with a cross sign and those that agree with the history of the station with a circle or a triangle corresponding to a relocation or a change in height, respectively. The histogram shows the distri- bution of the changes during the century, both for the detected inhomogeneities (black bars) and for the chang- es registered in the metadata (white bars). More complete history files of stations would be de- 972 VOLUME 14J O U R N A L O F C L I M A T E sirable to discuss the potential influence of changes in instrumentation, observer, and other factors upon the quality of the dataset. Unfortunately, only relocation and changes in the height of a few stations could be related to identified inhomogeneities (30% of the total). The cases in which data from external sites were used to fill gaps in the time series, both at the early stages of this work and by the original sources (see section 2) have also been considered as a type of relocation in the com- parison of inhomogeneities and metadata. Relocation can lead to inhomogeneities as a result of the difference in annual precipitation in both the old and the new po- sition as well as differences in sheltering conditions. In 70% of the inhomogeneities identified as relocations an increase of catchment (factor . 1) has been produced. However, the mean of all correction factors in Fig. 4c does not depart significantly from one, therefore, there is no bias in the occurrence of inhomogeneities either to increase or decrease precipitation means on the whole dataset. This is due to the fact that unknown changes (crosses in Fig. 4c) have led to a decrease in catchment (factor , 1) in 66% of the cases. Nevertheless, changes in specific sites have played a decisive role as will be shown later. The three changes in height (triangles) seem to agree somehow in the type of inhomogeneity. The two first ones occur in La Coruña (1916) and San Sebastián, Spain (1918) and reveal an increase in catchment related to a new position of the pluviometer in a lower and probably more sheltered location. The third one (Gi- braltar in 1935) corresponds to a change of the pluvi- ometer to a higher elevation, in principle less sheltered, and thus favoring a decrease in catchment. Some of the inhomogeneities lie on the 1.0 value line. All of them correspond to cases in which two inho- mogeneities were detected, but only the data between them had to be corrected, thus in such cases the first inhomogeneity is identified in Fig. 4c with value 1.0. Finally it is also worth pointing out that 80% of the inhomogeneities occur before 1950. Such a bias is also present in the histogram of the medatata (white bars). Though the metadata archives are incomplete they can be taken as an indication that there have been fewer changes in the more recent decades. This supports the smaller amount of inhomogeneities after 1950. Figure 5 gives some typical examples of the appli- cation of the Standard Normal Homogeneity Test to several types of time series showing values for the T statistic [Eq. (4)] and the series of ratios qi [Eq. (2)]. Figure 5a displays results for Murcia, Spain (station number 74), classified as homogeneous in the fourth step of the procedure (H4) and shows no significant changes in the T statistic; there is a secondary maximum in 1955 that seems to agree with a change in the position of the rain gauge registered in the metadata. The series of ratios, q, does not show apparent displacements from the mean value. Palma de Mallorca, Spain (Fig. 5b), is an example of homogeneous after one correction series (HC4). In step 2 of the procedure (dots), the test reveals a wide T maximum centered in 1940, possibly related to a change in the position of the rain gauge in 1937. This maximum disappears after the correction factor is applied to the values before 1937. The q values also reveal this inhomogeneity: noncorrected values in dots cluster above (below) 1 for most years before (after) the change while for the corrected ones (continuous line) this steplike behavior is reduced. Another example (Fig. 5c) of homogeneous series after applying one correction is La Coruña (number 19). In this case the test statistic shows several significant maxima. The most important one takes place in 1915 and agrees with a change in the height of the rain gauge; the second peak occurs in 1942 and does not match with any entry in the historical records. The first peak dis- appears and the second one is not significant after ap- plying the correction factor to the data before 1915. The q series shows some displacement from the mean value before 1915 (dots in the oval), which also disappears after the correction is applied. Hervás (number 56), one of the series reconstructed for the dataset, is yet another example of HC4-type series. The T statistic shows (Fig. 5d) a low-amplitude maximum in 1983 not supported by the available metadata that drops out in step 4 of the procedure. Bilbao, Spain (number 20), described by Fig. 5e is an example of a series that became homogeneous after two corrections. The values of the T statistic in the second step (hollow dots) show a maximum centered in 1946 that agrees with a reference in the metadata in 1945. After the correction of this inhomogeneity, a sec- ond maximum rises over the significance level in the fourth step (solid dots). This peak agrees with a change in the position of the rain gauge in 1920. In the sixth step both inhomogeneities are corrected and the T sta- tistic shows no important peaks. The q values also show these changes: it can be noticed how noncorrected data (dots) depart more strongly from unity than the cor- rected ones (solid line), particularly during the period between both inhomogeneities (highlighted with the oval). Finally, results for Alange, Spain (number 69), are shown also in Fig. 5f. This is the only station which remained inhomogeneous after two corrections (ICC). The T values exhibit significant maxima in years 1934, 1917, and 1955 corresponding to the steps 2, 4, and 6 of the procedure, respectively. The series of q ratios show an appreciable displacement from the mean value only in step 2 (hollow dots in the oval). The inhomo- geneity in 1917 is supported by only a single peak value and the maximum in step 6 hardly reaches the T95 sig- nificance level. This accounts for the low magnitude of these inhomogeneities and their lack of evidence in the series of ratios for steps 4 and 6. The three inhomo- geneities are probably related to data interpolations from the secondary sites. The correction factors deduced from the analysis of 1 MARCH 2001 973G O N Z Á L E Z - R O U C O E T A L . FIG. 5. Values of T and q for (a) Murcia, (b) Palma de Mallorca, (c) La Coruña, (d) Hervás, (e) Bilbao, and (f ) Alange. Dates of metada- ta are indicated with arrows in the horizontal axis. Here q data before the inhomogeneities are highlighted in the oval. 974 VOLUME 14J O U R N A L O F C L I M A T E FIG. 6. Temporal distribution of the annual percentage of missing data. FIG. 7. Trends and outputs of a centered 5-yr moving average for monthly precipitation values in La Coruña before (dashed line) and after (solid line) the homogeneity correction. the annual data were applied to the monthly data and interpolation for missing values was also accomplished following Eqs. (6) and (7). Figure 6 shows the temporal distribution of missing values in the dataset. The highest percentages are reached at the beginning of the century due to the lower amount of stations at this time. There is a secondary maximum in the 1970s and 1980s due to the shut down of measurements in some stations. A small maximum close to 1940 related to the Spanish Civil War (1936–39) can also be appreciated. After the interpolation was accomplished Camporredondo, Spain (number 25), and Algiers, North Africa (number 86), were eliminated from the dataset because of the long data gaps present and neighboring stations that can de- scribe the local climate features were available in both cases. c. Trends This section provides a description of some effects of the adjustments. The effects of outliers adjustments on trends and on the homogenization procedure are de- scribed as well as the effects of homogenization cor- rections on trends. A more complete description of trends, including a discussion of connections to general circulation changes, is beyond the scope of the present work. Some results with the same dataset are presented by González-Rouco et al. (2000). Trends are obtained through the calculation of least square linear regressions of precipitation data against time. A significance t test (Snedecor and Cochran 1972) for constant precipitation is applied. The effective sam- ple size and autocorrelation in the residuals are also taken into account. As precipitation time series often show high autocorrelation, values are not independent of each other and the effective number of data has also been used in the testing of significance (Trenberth 1984; Zwiers and von Storch 1995). Also, residuals resulting from the least square regression are usually assumed to be white noise, but this is not often the case. When residuals show autocorrelation both the slope and its error are badly estimated. We have used a Durbin–Wat- son method (Durbin and Watson 1971; Valero et al. 1996a) to account for the existence of autocorrelation in the residuals. An AR(1) model has been assumed for the errors of the least square regression whenever sig- nificant autocorrelations were detected. Figure 7 shows the effect of the homogeneity cor- rection for the time series from La Coruña. The solid lines correspond to a low-pass filter output and a linear fit to the corrected data; the dashed line is similarly obtained from the original series. The linear fit to non- corrected data shows a significant (a 5 0.05) slope of 3.6 mm decade21 that is reduced to a nonsignificant value of 0.9 mm decade21 after the correction. Figure 8 shows trends for the dataset in different stag- es: for the original data, without corrections (Fig. 8a), after applying outliers corrections (Fig. 8b), after ap- plying outliers and homogeneity corrections (Fig. 8c), and after interpolating data in the homogenized dataset for the period 1899–1989. In Figs. 8a–c results for each time series correspond only to the period of existence of data within the time interval 1899–1989 (i.e., Gre- noble, France: 1899–1976, see Table 1), therefore, in- terpreting spatial structures from these plots should be done with caution since the time span of the time series can change from site to site. In the noncorrected data (Fig. 8a) 30% of the trends are significant. In general, positive trends seem to dom- inate, particularly in the north of the Iberian Peninsula. Figure 8b shows trends after the corrections of outliers were applied. Nearly the same pattern is obtained, ex- cept in some specific cases where changes in the mag- nitude and significance of slopes arise: Perpignan, France (number 30), Palencia (39) and Leon, Spain (32), as the only change in sign, and Castellón, Spain (50). In order to check the influence of outliers corrections in the homogenization procedure, step 1 in Fig. 2 was rerun with the original data without outliers corrections. The test results were different in eight cases: stations 1 MARCH 2001 975G O N Z Á L E Z - R O U C O E T A L . FIG. 8. Trends in the (a) original dataset, (b) after correcting outliers, (c) after correcting outliers and inhomogeneities. Annual means for the periods of existence of data in each station have been used for the calculations. (d) Trends in the corrected dataset after interpolating missing data for the period 1899–1989 (inputs were also annual means). A solid line for value zero is plotted for easy visualization of different regions. 38, 43, 47, 71, and 84 switched from inhomogeneous to homogeneous and 72, 87, and 94 from homogeneous to inhomogeneous. In all these cases T values were very close to the significance level. If outliers were not trimmed the series were pushed to the significance or nonsignificance side depending on the specific charac- teristics of the time series. Outliers cause an increase in the mean of the period in which they occur. If this rise in mean sharpens (shortens) the differences between the period before and after a given change, it will in- crease (decrease) the T values and the significance of the change. The authors consider that it is safer to take decisions upon the quality of the time series that depend mostly on its overall characteristics rather than on the extreme behavior of a few data. This is also valid for the calculation of trends, since we are interested in re- sults that describe the long-term behavior of the time series and are not so sensitive to individual data. There- fore we think that the use of trimmed data is justified. Figure 8c shows the trends after the data have been corrected for outliers and inhomogeneities. If we com- pare it to Fig. 8b we can describe the changes as dra- matic. Significance is reached in only eight sites and the picture is more uniform. In most sites the value of trends is negligible and there are almost no negative trends left; positive trends in the north of the Iberian Peninsula still remain. Figure 8d shows trends of the corrected and inter- polated data for the period 1899–1989. A solid zero line is plotted for easier visualization of regions of different sign. Though significance is reached only at a few sta- tions, a highly coherent spatial behavior arises in this 976 VOLUME 14J O U R N A L O F C L I M A T E FIG. 9. Trends in the final dataset for (a) winter, (b) spring, (c) summer, and (d) fall. (e) Winter trends for the period 1920–89. Monthly data for each season were used as inputs. Full symbols highlight significant trends (95% significance level). A solid line for value zero is plotted for easy visualization of different regions. plot. All stations close to the Atlantic coast show pos- itive trends as do most French stations. On the east coast of the Iberian Peninsula, positive values also arise. The Pyrenees, the French Mediterranean coast, north of Af- rica, and quite a few stations in the middle and south of Spain show negative values. These annual trends result from the cumulative effects of seasonal trends. Some insight into the latter is ob- tained from Fig. 9. Slopes have been obtained using monthly anomalies for each season (winter: December– February; spring: March–May; summer: June–August; autumn: September–November). Anomalies were cal- culated by subtracting the long-term monthly means to filter out the mean annual cycle. Trends in winter (Fig. 9a) show positive values over nearly all the area except for the north of Africa. Sig- nificance is reached in 40% of the trends. In spring (Fig. 9b) the positive trends lose significance in the northern half of the Iberian Peninsula and give way to negative trends in the southern half. In the summer (Fig. 9c) the pattern alternates areas of positive and negative values. In autumn (Fig. 9d), negative, often significant, precip- itation trends are obtained nearly everywhere except for some sites on the east coast and in the northwest. The decrease of precipitation in autumn and spring and in- crease in winter indicates strengthening of the annual cycle. These results agree with those of other authors that report a significant downward trend during this cen- tury, both in the intensity of the zonal circulation in the North Atlantic and in the amplitude of its annual cycle, 1 MARCH 2001 977G O N Z Á L E Z - R O U C O E T A L . these being ascribed by changes in the North Atlantic oscillation (NAO) index (Lamb and Peppler 1987; Hur- rell 1995). When the NAO index annual amplitude is less than normal, that is, when smaller autumn–winter and/or greater NAO for summer is observed (Makro- giannis et al. 1991) the amplitude of the annual wave in precipitation in the Iberian Peninsula increases (Val- ero et al. 1996b). It is worth remarking on the possible influence of missing data on trends. Figure 6b shows that changes in the early part of the century can have a considerable influence on trends. According to Fig. 6a, 40% of the data are missing during the first decade of the century and are interpolated using information from the existing 60%. There is a possibility that this interpolation biases the magnitude and sign of the trends toward spatial con- tinuity. This has been explored through calculating trends for the period 1920–89, thus avoiding the prob- lem of long missing data gaps in some stations. Figure 9e shows the result for the wintertime. Though the mag- nitude of trends is not so high as in Fig. 9a, regional agreement is still the general feature. The magnitude of trends is smaller than in Fig. 9a, but this could also be due to natural variability. In addition to the interpolation method we suggest that the homogenization procedure could influence some regional agreement. The funda- mental hypothesis in the Standard Normal Homogeneity Test states that the ratios between a test station and some regional precipitation average should be constant. It is possible that this assumption leads to removing distinct local trends in stations imbedded in microclimates and where changes in long-term trends are not necessarily related to homogeneity problems. 5. Discussion and conclusions Outliers and inhomogeneities have been taken into account as potential quality problems in a dataset of 95 precipitation time series in southwestern Europe. Outliers adjustments have been carried out by trim- ming very extreme data with the aim of reducing large distribution tails as a means of preprocessing previous to homogeneity corrections. There are, however, caveats to the use of this subjective approach since it involves dismissing interesting climatological information about extreme events and their temporal and spatial distri- bution. Nevertheless, it is worth noting that this does not mean a ‘‘point of no return’’ since the original value of the extreme data can be returned to the dataset at any time, also incorporating the homogeneity corrections. The outliers adjustments applied herein lead to chang- es of minor importance in trend assessment as shown in the previous section. Concerning the impact of out- liers adjustments in the homogeneity test, this becomes evident in cases were the T values are very close to the significance level. In such situations the effect of a few extreme values can lead to significance or not signifi- cance of an inhomogeneity, depending on the case. Though it might seem contradictory to support correc- tions that have such a limited effect, it is the authors’ opinion that the use of trimmed data is legitimate be- cause decisions concerning trends and especially ho- mogeneity are then taken with regard to the long-term behavior of the series rather than the marginal extreme properties of a few values. Analysis of the homogeneity of the time series found 40% were homogeneous, an additional 49.5% were ho- mogeneous after applying one correction factor and 9.5% more after applying two correction factors. In 30% of the cases the inhomogeneities could be linked to some historical change in the rain gauge, in spite of the me- tadata being scarce. Examples have been presented that provide evidence that precipitation records from the area of study show severe homogeneity problems. There is considerable influence of homogeneity corrections in the estimation of trends, emphasizing that the results of any long-term variability study from this area are very dependent upon the use of homogenized or nonhomo- genized data. Trends for the corrected data present great regional agreement. Winter values show positive and usually sig- nificant trends over most of the study area. Significant negative trends show up mainly in autumn. These trends seem to agree with those described in the literature for the zonal circulation in the North Atlantic (Makrogian- nis et al. 1991; Valero et al. 1996b). A brief description of trends in relationship to the outliers and homogeneity adjustments made on the dataset has been provided. A deeper insight into long-term trends and their relation- ship to general circulation is beyond the scope of this work. The Standard Normal Homogeneity Test is based on the hypothesis that the ratios between the test-station values and some regional precipitation average should be constant. It is possible that this underlying assump- tion leads to removing distinct local trends in stations embedded in microclimates and not necessarily related to inhomogeneity problems. It should also be pointed that the output of such a quality control procedure is not expected to be an error-free dataset, but is one where the quality of the time series has been significantly im- proved. On the other hand, different methodologies can lead to partially different results, since not only the error associated with any homogeneity test, but also the var- ious methods and criteria that can be chosen for the selection of reference stations add uncertainty. Different countries and institutions have chosen methodologies that specifically suit their purposes (Peterson et al. 1998b). In this sense, work intercomparing different methods is desired and still to come. Acknowledgments. This work was supported by Pro- ject Number CLI97-0558, Comunidad de Madrid, and Fundación Ramón Areces. The authors also thank P. Brandt, P. Cuesta, M. Widmann, and E. Zorita for their comments on the original version of the manuscript. We 978 VOLUME 14J O U R N A L O F C L I M A T E thank also F. Zwiers and the two anonymous reviewers for their valuable comments and suggestions. REFERENCES Alexanderson, H. A., 1986: A homogeneity test applied to precipi- tation data. J. Climatol., 6, 661–675. Almarza, C., J. A. López, and C. Flores, 1996: Homogeneidad y variabilidad de los registros históricos de precipitación de Es- paña. Tech. Rep. A-143, Instituto Nacional de Meteorologı́a, Madrid, Spain, 318 pp. [Available from Instituto Nacional de Meteorologı́a, Madrid, Spain.] Barnett, V., and T. Lewis, 1994: Outliers in Statistical Data. 3d ed. J. Wiley and Sons, 604 pp. Buishand, T. A., 1982: Some methods for testing the homogeneity of rainfall records. J. Hydrol., 58, 11–27. Conrad, V., and C. Pollak, 1950: Methods in Climatology. 2d ed. Harvard University Press, 459 pp. Durbin, J., and G. S. Watson, 1971: Testing for serial correlation in least squares regression, III. Biometrika, 37, 1–19. Eischeid, J., C. B. Baker, T. Karl, and H. F. Dı́az, 1995: The quality control of long-term climatological data using objective data analysis. J. Appl. Meteor., 34, 2787–2795. Esteban-Parra, M. J., F. S. Rodrigo, and J. Castro-Diez, 1998: Spatial and temporal patterns of precipitation in Spain for the period 1880–1992. Int. J. Climatol., 18, 1557–1574. Fernández-Mills, G., 1995: Principal component analysis of precip- itation and rainfall regionalization in Spain. Theor. Appl. Cli- matol., 50, 117–131. Filippov, V. V., 1968: Quality control procedures for meteorológical data. Tech. Rep. 26, WMO, Geneva, Switzerland, 38 pp. Garrido, J., J. A. Garcia, and V. L. Mateos, 1996: Homogeneidad y variabilidad natural de series largas de precipitación. An. Fı́s. 92, 19–31. González-Rouco, J. F., H. Heyen, E. Zorita, and F. Valero, 2000: Agreement between observed rainfall trends and climate change simulations in the southwest of Europe. J. Climate, 13, 3057– 3065. Grant, E. L., and R. S. Leavenworth, 1972: Statistical Quality Con- trol. McGraw-Hill, 764 pp. Hanssen-Bauer, I., and J. Førland, 1994: Homogenizing long Nor- wegian precipitation series. J. Climate, 7, 1001–1013. Hawkins, P. M., 1977: Testing a sequence of observations for a shift in location. J. Amer. Stat. Assoc., 72, 180–186. Hegerl, G. C., H. von Storch, K. Hasselmann, B. D. Santer, U. Cu- basch, and P. D. Jones, 1996: Detecting anthropogenic climate change with an optimal fingerprint method. J. Climate, 9, 2281– 2306. Houghton, J., B. A. Callendar, and S. K. Varney, Eds., 1992: Climate Change: The Supplemntary Report to the IPCC Scientific Ass- esment. Cambridge University Press, 205 pp. , L. G. Meira Filho, B. A. Callander, N. Harris, A. Kattenberg, and K. Maskell, Eds., 1995: Climate Change 1995: The Science of Climate Change. Cambridge University Press, 572 pp. Hurrell, J. W., 1995: Decadal trends in the North Atlantic Oscillation: Regional temperatures and precipitation. Science, 269, 676–679. Jones, P., and Coauthors, 1985: A grid point surface air temperature data set for the northern hemisphere. Tech. Rep. TRO22, U.S. Department of Energy, Carbon Dioxide Research Division, Washington, DC, 251 pp. Karl, T. R., and C. N. Williams, 1987: An approach to adjusting climatological time series for discontinuous inhomogeneities. J. Climate Appl. Meteor., 26, 1744–1763. Lamb, P. J., and R. Peppler, 1987: North Atlantic Oscillation: Concept and application. Bull. Amer. Meteor. Soc., 68, 1218–1225. Lanzante, J. R., 1996: Resistant, robust and non-parametric tech- niques for the analysis of climate data: Theory and examples, including applications to historical radiosonde station data. Int. J. Climatol., 16, 1197–1226. Linés, A., 1970: The Climate of the Iberian Peninsula. Climates of Northern and Western Europe, C. Wallen, Ed., World Survey of Climatology, Vol. 5, Elsevier, 195–239. Maheras, P., 1988: Changes in precipitation conditions in the western Mediterranean over the last century. J. Climatol., 8, 179–189. Makrogiannis, T. J., H. S. Sahsamanoglou, A. A. Flocas, and A. Bloutsos, 1991: Analysis of the monthly zonal index values and longterm changes in the North Atlantic and Europe. Int. J. Cli- matol., 11, 493–503. Mitchell, J. M., B. Dzerdzeevski, H. Flohn, W. L. Hofmeyr, H. H. Lamb, K. N. Rao, and C. C. Wallen, 1966: Climate change. Tech. Rep. 79, WMO, Geneva, Switzerland, 79 pp. Peterson, T. C., R. Vose, R. Schmoyer, and V. Razuvaev, 1998a: Global Historical Climatology Network (GHCN) quality control of monthly temperature data. Int. J. Climatol, 18, 1169–1179. , and Coauthors, 1998b: Homogeneity adjustments of in situ atmospheric climate data: A review. Int. J. Climatol., 18, 1493– 1517. Rhoades, D. A., and M. J. Salinger, 1993: Adjustment of temperature and rainfall records for site changes. Int. J. Climatol., 13, 899– 913. Rodriguez-Puebla, C., A. H. Encinas, S. Nieto, and J. Garmendia, 1998: Spatial and temporal patterns of annual precipitation var- ibility over the Iberian Peninsula. Int. J. Climatol., 18, 299–316. Santer, B. D., K. E. Taylor, T. M. Wigley, J. E. Penner, P. D. Jones, and U. Cubash, 1995: Towards the detection and attribution of an anthropogenic effect on climate. Climate Dyn., 12, 77–100. Serrano, A., J. A. Garcı́a, V. L. Mateos, M. L. Cancillo, and J. Garrido, 1999: Monthly modes of variation of precipitation over the Ibe- rian peninsula. J. Climate, 12, 2894–2919. Snedecor, G. W., and W. G. Cochran, 1972: Statistical Methods. 6th ed. Iowa University Press, 593 pp. Trenberth, K. E., 1984: Some effects of finite sample size and per- sistence on meteorological statistics. Part I: Autocorrelations. Mon. Wea. Rev., 112, 2359–2368. , and D. A. Paolino, 1980: The Northern Hemisphere sea-level pressure data set: Trends, errors and discontinuities. Mon. Wea. Rev., 108, 856–872. Valero, F., J. A. Garcı́a-Miguel, M. L. Martin, and Y. Luna, 1993: Detecting long-term variations in precipitation in the souteastern Iberian Peninsula. Ann. Geophys., 11, 837–845. , J. F. González, F. J. Doblas, and J. A. Garcı́a-Miguel, 1996a: A method for the reconstruction and temporal extension of cli- matological time series. Int. J. Climatol., 16, 213–227. , F. J. Doblas, J. F. González, and J. A. Garcı́a-Miguel, 1996b: On long-term evolution of seasonal precipitation in southwestern Europe. Ann. Geophys., 14, 976–985. , R. M. Rodrı́guez, and J. A. Garcı́a-Miguel, 1997a: Filling data algorithms in urban air pollution monitoring networks. J. En- viron. Eng. Div. ASCE, 103, 395–403. , M. Y. Luna, and M. Martin, 1997b: An overwiew of a heavy rain event in Southeastern Iberia: The role of large-scale me- teorological conditions. Ann. Geophys., 15, 494–502. Zorita, E., V. Kharin, and H. von Storch, 1992: The atmospheric circulation and sea surface temperature in the North Atlantic area in winter: Their interaction and relevance for Iberian pre- cipitation. J. Climate, 5, 1097–1108. Zwiers, F. W., and H. von Storch, 1995: Taking serieal correlations into account in tests of the mean. J. Climate, 8, 336–351.