|
ABSTRACT
To analyze the effect of the oceans and atmosphere on land climate, Earth Scientists have developed climate indices, which are time series that summarize the behavior of selected regions of the Earth's oceans and atmosphere. In the past, Earth scientists have used observation and, more recently, eigenvalue analysis techniques, such as principal components analysis (PCA) and singular value decomposition (SVD), to discover climate indices. However, eigenvalue techniques are only useful for finding a few of the strongest signals. Furthermore, they impose a condition that all discovered signals must be orthogonal to each other, making it difficult to attach a physical interpretation to them. This paper presents an alternative clustering-based methodology for the discovery of climate indices that overcomes these limitiations and is based on clusters that represent regions with relatively homogeneous behavior. The centroids of these clusters are time series that summarize the behavior of the ocean or atmosphere in those regions. Some of these centroids correspond to known climate indices and provide a validation of our methodology; other centroids are variants of known indices that may provide better predictive power for some land areas; and still other indices may represent potentially new Earth science phenomena. Finally, we show that cluster based indices generally outperform SVD derived indices, both in terms of area weighted correlation and direct correlation with the known indices.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
L. Ertöz, M. Steinbach, and V. Kumar. Finding topics in collections of documents: A shared nearest neighbor approach. In Proceedings of Text Mine'01, First SIAM International Conference on Data Mining, Chicago, IL, USA, 2001.
|
| |
3
|
L. Ertöz, M. Steinbach, and V. Kumar. A new shared nearest neighbor clustering algorithm and its applications. In Workshop on Clustering High Dimensional Data and its Applications, SIAM Data Mining 2002, Arlington, VA, USA, 2002.
|
| |
4
|
L. Ertöz, M. Steinbach, and V. Kumar. Finding clusters of different sizes, shapes, and densities in noisy, high dimensional data. In Proceedings of Second SIAM International Conference on Data Mining, San Francisco, CA, USA, May 2003.
|
| |
5
|
M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD 1996, pages 226--231, 1996.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
B. Lindgren. Statistical Theory. CRC Press, January 1993.
|
| |
10
|
C. Potter, S. A. Klooster, and V. Brooks. Inter-annual variability in terrestrial net primary production: Exploration of trends and controls on regional to global scales. Ecosystems, 2(1):36--48, August 1999.
|
| |
11
|
N. Saji, B. Goswami, P. Vinaychandran, and T. Yamagata. A dipole mode in the tropical indian ocean. Nature, 401:360--363, 1999.
|
| |
12
|
P. Smyth, K. Ide, and M. Ghil. Multiple regimes in northern hemisphere height fields via mixture model clustering. Journal of Atmospheric Science, 56:3704--3723, 2000.
|
| |
13
|
M. Steinbach, P.-N. Tan, V. Kumar, S. Klooster, and C. Potter. Temporal data mining for the discovery and analysis of ocean climate indices. In Proceedings of the KDD Temporal Data Mining Workshop, Edmonton, Alberta, Canada, August 2002.
|
| |
14
|
M. Steinbach, P.-N. Tan, V. Kumar, C. Potter, and S. Klooster. Data mining for the discovery of ocean climate indices. In Mining Scientific Datasets Workshop, 2nd Annual SIAM International Conference on Data Mining, April 2002.
|
| |
15
|
M. Steinbach, P.-N. Tan, V. Kumar, C. Potter, S. Klooster, and A. Torregrosa. Clustering earth science data: Goals, issues and results. In Proceedings of the Fourth KDD Workshop on Mining Scientific Datasets, San Francisco, California, USA, August 2001.
|
| |
16
|
H. V. Storch and F. W. Zwiers. Statistical Analysis in Climate Research. Cambridge University Press, July 1999.
|
| |
17
|
P.-N. Tan, M. Steinbach, V. Kumar, S. Klooster, C. Potter, and A. Torregrosa. Finding spatio-termporal patterns in earth science data. In KDD Temporal Data Mining Workshop, San Francisco, California, USA, August 2001.
|
| |
18
|
G. H. Taylor. Impacts of the el nino/southern oscillation on the pacific northwest. Technical report, Oregon State University, Corvallis, Oregon, 1998.
|
| |
19
|
J. C. Tilton. Image segmentation by region growing and spectral clustering with a natural convergence criterion. In Proc. of the 1998 International Geoscience and Remote Sensing Symposium (IGARSS'98), Seattle, WA, 1998.
|
| |
20
|
N. Vivoy. Automatic classification of time series (acts): a new clustering method for remote sensing time series. International Journal of Remote Sensing, 2000.
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|