The protocol extracts information from light curves of exoplanets and constructs their surface maps. It uses light curves of Earth, which serves as a proxy exoplanet, to demonstrate the approach.
Spatially resolving exoplanet features from single-point observations is essential for evaluating the potential habitability of exoplanets. The ultimate goal of this protocol is to determine whether these planetary worlds harbor geological features and/or climate systems. We present a method of extracting information from multi-wavelength single-point light curves and retrieving surface maps. It uses singular value decomposition (SVD) to separate sources that contribute to light curve variations and infer the existence of partially cloudy climate systems. Through analysis of the time series obtained from SVD, physical attributions of principal components (PCs) could be inferred without assumptions of any spectral properties. Combining with viewing geometry, it is feasible to reconstruct surface maps if one of the PCs are found to contain surface information. Degeneracy originated from convolution of the pixel geometry and spectrum information determines the quality of reconstructed surface maps, which requires the introduction of regularization. For the purpose of demonstrating the protocol, multi-wavelength light curves of Earth, which serves as a proxy exoplanet, are analyzed. Comparison between the results and the ground truth is presented to show the performance and limitation of the protocol. This work provides a benchmark for future generalization of exoplanet applications.
Identifying habitable worlds is one of the ultimate goals in astrobiology1. Since the first detection2, more than 4000 exoplanets have been confirmed to date3 with a number of Earth analogs (e.g., TRAPPIST-1e)4. These planets have orbital and planetary properties similar to those of Earth, and therefore are potentially habitable. Evaluating their habitability from limited observations is essential in this context. Based on the knowledge of life on Earth, geological and climate systems are critical to habitability, which can therefore serve as biosignatures. In principle, features of these systems could be observed from a distance even when a planet could not be spatially resolved better than one single point. In this case, identifying geological features and climate systems from single-point light curves is essential when assessing the habitability of exoplanets. Surface mapping of these exoplanets becomes urgent.
Despite the convolution between viewing geometry and spectral features, information of an exoplanet’s surface is contained in its time-resolved single-point light curves, which can be obtained from a distance, and derived with sufficient observations. However, two-dimensional (2D) surface mapping of potentially habitable Earth-like exoplanets is challenging due to the influence of clouds. Methods of retrieving 2D maps have been developed and tested using simulated light curves and known spectra5,6,7,8, but they have not been applied to real observations. Moreover, in the analyses of exoplanet observations now and in the near future, assumptions of characteristic spectra may be controversial when the planetary surface compositions are not well-constrained.
In this paper, we demonstrate a surface mapping technique for Earth-like exoplanets. We use SVD to evaluate and separate information from different sources that is contained in multi-wavelength light curves without assumptions of any specific spectra. Combined with viewing geometry, we present the reconstruction of surface maps using timely resolved but spatially convoluted surface information. For the purpose of demonstrating this method, two-year multi-wavelength single-point observations of Earth obtained by the Deep Space Climate Observatory/Earth Polychromatic Imaging Camera (DSCOVR/EPIC; www.nesdis.noaa.gov/DSCOVR/spacecraft.html) are analyzed. We use Earth as a proxy exoplanet to assess this method because currently available observations of exoplanets are not sufficient. We attach the code with the paper as an example. It is developed under python 3.7 with anaconda and healpy packages, but the mathematics of the protocol can also be done in other programming environments (e.g., IDL or MATLAB).
1. Programming setup
2. Obtaining multi-wavelength light curves and viewing geometry from observations
3. Extract surface information from light curves
4. Construct planetary surface map
5. Estimate uncertainty of the retrieved map
We use multi-wavelength single-point light curves of Earth to demonstrate the protocol, and compare the results with the ground truth to evaluate the quality of surface mapping. Observation used here is obtained by DSCOVR/EPIC, which is a satellite located near the first Lagrangian point (L1) between Earth and Sun taking images at ten wavelengths of the sunlit face of Earth. Two years (2016 and 2017) of observations are used for this demonstration, which are the same as those in Jiang et al. (2018)12 and Fan et al. (2019)13, where more details about the observations are presented. A sample observation at 9:27 UTC, 2017 February 8 is shown in Figure 1. Images of Earth are integrated to single points to simulate light curve observations obtained by Aliens, distant observers, who could not spatially resolve Earth better than one pixel. Therefore, multi-wavelength single-point exoplanet light curves with ~10,000 time steps are generated, which are the input data of this protocol.
Following step 3, we find two dominant PCs in the multi-wavelength light curves, and the second PC (PC2) contains surface information. Derived as step 3.5, time series of PC2 shows more regular morphology with an approximately constant daily variation, and its power spectrum shows stronger diurnal cycle than the first PC (PC1, Figure 2). Therefore, a surface map of this proxy exoplanet is constructed following step 4 (Figure 3a), which consists of the value of PC2 at each pixel. Compared with the ground truth of Earth (Figure 3b), the reconstructed map recovers all major continents, despite some disagreements in the southern hemisphere where clouds partially prevent surface information from being observed. Uncertainty of each pixel value obtained according to step 5 (Figure 3c) is on the order of 10% of that in the retrieved map, suggesting a good quality of the surface mapping and a positive result.
Figure 1: Reflectance images of Earth’s sunlit hemisphere.
The observations are taken by DSCOVR/EPIC at ten wavelengths and at 9:27 UTC, 2017 February 8. Please click here to view a larger version of this figure.
Figure 2: Time series and power spectra of the two dominant PCs.
(a) Time series of PC1. Daily maximum and minimum are denoted by black lines. (b) Power spectrum of PC1’s time series. Annual, semi-annual, diurnal and half-daily cycles are denoted as black dashed line. (c) and (d) are identical to (a) and (b), but correspond to PC2. This figure is taken from Fan et al. (2019)13. Please click here to view a larger version of this figure.
Figure 3: Reconstruction of Earth’s surface.
(a) Surface map of Earth, which serves as a proxy exoplanet, reconstructed from multi-wavelength light curves. Colors in the map are the values of PC2 at each pixel. The contour of median value is denoted as the black line. (b) Ground truth of Earth’s surface map. (c) Uncertainty of the reconstructed map shown in (a). This figure is modified from Fan et al. (2019)13. Please click here to view a larger version of this figure.
Figure 4: Result of finding the optimal regularization parameter.
The optimal value of the regularization parameter λ is 10-3.153 (dashed line) when χ2 of the reconstruction (solid line) reaches its minimum. Please click here to view a larger version of this figure.
Figure 5: Sensitivity test of observation noise.
(a) Correlation coefficient between PC2 and land fraction in the field of view (solid line) derived from observations with different signal to noise ratios (S/Ns). The original correlation from light curves without noise is shown as the dashed line. (b) The importance of each PC on the land fraction derived with different observation S/Ns. The importance is computed using Gradient Boosted Regression Trees (GBRT) models as described in Fan et al. (2019)13. Please click here to view a larger version of this figure.
Supplemental Figure 1: Time series of reflectance of Earth at ten wavelengths. Please click here to download this figure.
Supplemental Figure 2: (a) Time series of latitude of the sub-observer point. (b) Same as (a), but for longitude. (c) and (d) are identical to (a) and (b), but correspond to the sub-stellar point. Please click here to download this figure.
Supplemental Figure 3: Time series of normalized reflectance of Earth at ten wavelengths. Please click here to download this figure.
Supplemental Figure 4: Time series of the ten PCs, columns of U. Please click here to download this figure.
Supplemental Figure 5: Singular values corresponding to each PC, diagonal elements of Σ. Please click here to download this figure.
Supplemental Figure 6: Normalized reflectance spectra of ten PCs, columns of V. Please click here to download this figure.
Supplemental Figure 7: Power density spectra of time series of ten PCs. Please click here to download this figure.
Supplemental Figure 8: Pixelization of retrieving map, filled with random pixel values. Please click here to download this figure.
Supplemental Figure 9: Reconstructed map of Earth using different regularization parameters of (a) 10-2, (b) 10-3, and (c) 10-4. Please click here to download this figure.
Supplemental Figure 10: Covariance matrix of x. Please click here to download this figure.
Supplemental Figure 11: Square root of the diagonal elements of the covariance matrix of x, mapped onto the retrieved surface map. Please click here to download this figure.
Video S1: Pixel weights for observations at each time frame in 2016 and 2017.
Supplemental Files. Please click here to download these files.
One critical requirement of the protocol is the feasibility of extracting surface information from light curves, which depends on the cloud coverage. In step 3.5.1, the relative values of the PCs may be different among exoplanets. In the case of Earth, the first two PCs dominate the light curve variations, and correspond to surface-independent clouds and surface (Fan et al. 2019)13. They have comparable singular values so that the surface information can be separated following steps 3.5.2 and 3.5.3. For a future observation of exoplanet, in extreme cases of either an entirely cloudy or a cloud-free exoplanet, only one dominant PC would appear in the SVD at step 3.3. Spectral analysis is necessary in this case to interpret the meaning of this PC, as compositions of clouds and surface are different. If the dominant PC corresponds to the surface, step 4 and 5 could still be followed; if it corresponds to clouds, a conclusion can be drawn that surface information is blocked by clouds and therefore cannot be extracted using light curves at given wavelengths. In this case, surface mapping is not feasible. A third or even fourth comparable dominant PC may also exist, which can correspond to another layer of clouds or large-scale hydrological processes, and would not invalidate the following steps of the method as long as the surface information is extracted.
Degeneracy resulting from the convolution of geometry and spectrum is the dominant factor that constrains the quality of the retrieved map, as discussed in Cowan & Strait (2013)14 and Fujii et al. (2017)15. As the time series of dominant PCs cover only a small portion of the PC plane, there is always a trade-off between spatial and spectral variations. In other words, the retrieved map (Figure 2a) could not be much improved even with an infinite number of time steps and perfect observations, as long as using light curves at the same wavelengths. We introduce the regularization to partially relieve the degeneracy. The optimal value of the regularization term λ in step 4.4 is determined using observations synthesized by the ground truth, where the observed uj is replaced by weighted and scaled land fractions in the field of view (FOV). To generate the synthetic observation, we use the equation at step 4.3 and replace x with the ground truth land fractions of each pixel, y. y is scaled to the same range with x using the strong linear correlation between PC2 of observation, u2, and the averaged FOV land fraction13. Due to the degeneracy, y cannot be perfectly recovered from the linear regression in step 4.4, so we determine the optimal value of λ by finding the minimum of χ2, squared residual scaled by the variance of each pixel. The latter is estimated by the absolute value of each pixel. This is similar to the L-curve criterion in Kawahara & Fujii (2011)16. In the particular case of this paper where T=9739 and P=3072, the optimal value of λ is 10-3.153 (Figure 4).
Observation noise, another factor that influences the mapping quality, may corrupt the SVD analysis of light curves in practical. We test the robustness of the protocol by introducing different levels of observation noise to the original light curves. They are assumed to contain all noise sources (e.g. sky background, dark current and readout noise), and follow Gaussian distribution. In the original light curves without noise, PC2 shows strong (r2=0.91) linear correlation with the FOV land fraction13, so its time series is used for the surface mapping. With increasing noise level, the correlation between PC2 and surface becomes weaker (Figure 5). The correlation coefficient, r2, becomes below 0.5 when the signal to noise ratio (S/N) is less than 10 (Figure 5a), although the importance of PC2 is still dominant (Figure 5b). We suggest a minimum S/N of 30 for confidently applying the protocol in future generalization. It is worth noting that S/N here is the ratio of exoplanet signal to the observation noise, with the signal of parent star being removed.
Viewing geometry is assumed to be known at step 4.2, as precisely deriving viewing geometry from exoplanet observations is beyond the scope of this work. Besides the orbital elements that can be derived from light curve observations, and rotation period from power density spectra (Figure 2b and 2d), there are only two quantities, summer/winter solstice and obliquity, that are required for surface mapping. Summer/winter solstice usually coincide with extremum of time series of the surface corresponding PC, as long as there exists noticeable asymmetry between northern and southern hemispheres. Obliquity of the exoplanet can be inferred from its influence on the amplitude and frequency of light curves17,18. All these derivations require observations sampling frequency at least higher than that of planetary rotation, which is currently rarely satisfied for exoplanets.
The authors have nothing to disclose.
This work was partly supported by the Jet Propulsion Laboratory, California Institute of Technology, under contract with NASA. YLY acknowledge support by the Virtual Planetary Laboratory at the University of Washington.
Python 3.7 with anaconda and healpy packages | Other programming environments (e.g., IDL or MATLAB) also work. |