PROCESSING OCEANOGRAPHIC DATA BY PYTHON LIBRARIES NUMPY, SCIPY AND PANDAS
Abstract
The study area is located in western Pacific Ocean, Mariana Trench. The aim of the data analysis is to analyze the potential influence of how various geological and tectonic factors may affect the geomorphological shape of the Mariana Trench. Statistical analysis of the data set in marine geology and oceanography requires an adequate strategy on big data processing. In this context, current research proposes a combination of the Python-based methodology that couples GIS geospatial data analysis. The Quantum GIS part of the methodology produces an optimized representative sampling dataset consisting of 25 cross-section profiles having in total 12,590 bathymetric observation points. The sampling of the geospatial dataset are located across the Mariana Trench. The second part of the methodology consists of statistical data processing by means of high-level programming language Python. Current research uses libraries Pandas, NumPy and SciPy. The data processing also involves the subsampling of two auxiliary masked data frames from the initial large data set that only consists of the target variables: sediment thickness, slope angle degrees and bathymetric observation points across four tectonic plates: Pacific, Philippine, Mariana, and Caroline. Finally, the data were analyzed by several approaches: 1) Kernel Density Estimation (KDE) for analysis of the probability of data distribution; 2) stacked area chart for visualization of the data range across various segments of the trench; 3) spacial series of radar charts; 4) stacked bar plots showing the data distribution by tectonic plates; 5) stacked bar charts for correlation of sediment thickness by profiles, versus distance from the igneous volcanic areas; 6) circular pie plots visualizing data distribution by 25 profiles; 7) scatterplot matrices for correlation analysis between marine geologic variables. The results presented a distinct correlation between the geologic, tectonic and oceanographic variables. Six Python codes are provided in full for repeatability of this research.
Keywords
References
- Beazley, D.M. (2009). Python essential reference. Addison-Wesley Professional. Available at www.python.org (accessed 18.03.2019)
- Bogdanov, I., Huaman, D., Thovert, J.-F, Pierre, G., Adler, P.M. (2011). Tectonic stresses seaward of an aseismic ridge Trench collision zone. A remote sensing approach on the Loyalty Islands, SW Pacific. Tectonophysics, 499, 77-91.
- Hunter, J.D. (2007). Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3), 90-95. Bogolepov, K. V., Chikov, B. M. Geologiya dna okeanov (Geology of the ocean floor). Russian. Ed. by Saks, V. N., Fotiadi, E. E. Moscow: Nauka, 1976, 246.
- Brune, S. (2016). Plate Boundaries and Natural Hazards. AGU Geophysical Monograph 219. ed. by Duarte, J. C. & Schellart, W. P. Sydney, Australia: AGU. Chap. Rifts and rifted margins: A review of geodynamic processes and natural hazards, 1-21.
- Jones, E., Oliphant, T., Peterson, P. (2014). SciPy: open source scientific tools for Python. Available at www.scipy.org (accessed 23.03.2019)
- NumPy community (2019). NumPy Reference. Release 1.16.1. 1372 p.
- Contreras-Reyes, E., Carrizo, D. (2011). Control of high oceanic features and subduction channel on earthquake ruptures along the Chile-Peru subduction zone. Physics of the Earth and Planetary Interiors 186, 49-58.
- Crameri, F. (2018). Geodynamic diagnostics, scientific visualisation and StagLab 3.0. Geoscientific Model Development, 11, 2541-2562.
Details
Primary Language
English
Subjects
Hydrobiology , Maritime Engineering
Journal Section
Research Article
Authors
Publication Date
April 1, 2019
Submission Date
March 24, 2019
Acceptance Date
April 8, 2019
Published in Issue
Year 1970 Volume: 2 Number: 2