We describe a new method for counting fishes, and estimating relative abundance (MaxN) and fish density using rotating stereo-video camera systems. We also demonstrate how to use distance from camera (Z distance) to estimate species-specific detectability.
The use of video camera systems in ecological studies of fish continues to gain traction as a viable, non-extractive method of measuring fish lengths and estimating fish abundance. We developed and implemented a rotating stereo-video camera tool that covers a full 360 degrees of sampling, which maximizes sampling effort compared to stationary camera tools. A variety of studies have detailed the ability of static, stereo-camera systems to obtain highly accurate and precise measurements of fish; the focus here was on the development of methodological approaches to quantify fish density using rotating camera systems. The first approach was to develop a modification of the metric MaxN, which typically is a conservative count of the minimum number of fish observed on a given camera survey. We redefine MaxN to be the maximum number of fish observed in any given rotation of the camera system. When precautions are taken to avoid double counting, this method for MaxN may more accurately reflect true abundance than that obtained from a fixed camera. Secondly, because stereo-video allows fish to be mapped in three-dimensional space, precise estimates of the distance-from-camera can be obtained for each fish. By using the 95% percentile of the observed distance from camera to establish species-specific areas surveyed, we account for differences in detectability among species while avoiding diluting density estimates by using the maximum distance a species was observed. Accounting for this range of detectability is critical to accurately estimate fish abundances. This methodology will facilitate the integration of rotating stereo-video tools in both applied science and management contexts.
Along the U.S. Pacific Coast, many of the species important to commercial and recreational groundfish fisheries (e.g., the rockfish complex (Sebastes spp.) and Lingcod (Ophiodon elongatus)) are strongly associated with high-relief, hard-bottom habitats1,2,3,4,5. Stereo-video drop cameras are an attractive non-extractive tool to use in rocky habitats due to the relative ease and simplicity of operation. A variety of stereo-video camera systems have been developed and deployed in southern-hemisphere, shallow-water ecosystems6,7,8,9,10, and recently, video drop-cameras have gained traction as a management tool for deep water rocky-reef environments along the Pacific Coast11,12,13. We sought to modify these existing stereo-camera designs by using a stereo-video camera system (hereafter referred to as "Lander") to more efficiently characterize fish populations in high-relief seafloors along the central Pacific Coast (see Table of Materials). The Lander used was different than existing video systems because cameras were mounted to a central rotating bar, which allowed for a full 360° of coverage of the seafloor at the drop location14. The Lander completed one full rotation per minute, which allowed us to rapidly characterize the abundance and community composition of an area and achieve the same level of statistical power with fewer Lander deployments. (See Starr (2016)14 for greater detail on the specifics of the Lander configuration). Preliminary tests in the study system suggested that eight rotations of the cameras in our surveys were sufficient to characterize species abundance and richness. This determination was made by an observation of diminishing returns in species abundance and fish density over longer drops. We recommend that a pilot study including longer soak times be conducted in any new system to determine the optimal soak time for a given ecosystem/study species.
By using paired stereo cameras, both total survey area and absolute fish density can be calculated for each video survey; however, the use of rotating cameras necessitated the modification of traditional fish count metrics. Stationary video systems most often use "MaxN" as a conservative count of fishes on a deployment6,10. Traditional MaxN describes the maximum number of fish of a given species observed together in a single video frame, in order to avoid double counting a fish that has left and returned to frame. MaxN has therefore been an estimate of the minimum number of fish known to be present and may underestimate true fish abundance6,10. The MaxN metric was redefined to represent the greatest number of fish seen in each full rotation of the cameras.
The second modification to previous stereo video methods was to account for the fact that species of various sizes, color, and shapes have different maximum distances of reliable identification. For example, large species such as O. elongatus have a distinct elongated shape and can reliably be identified at much greater distances compared with small and cryptic species such as the Squarespot Rockfish (Sebastes hopkinsi). These different maximum ranges of detectability change the effective area sampled by the Lander for each species. Because the stereo cameras allow us to place every fish in three-dimensional space with a high degree of accuracy, one can determine the distance from the cameras that each fish was measured (i.e., the "Z distance", named for the "z-axis" which is perpendicular to the straight line drawn between the cameras). For each species, the distance within which 95% of all individuals were observed (hereafter "95% Z distance") was considered to be the radius of the survey area, and was used to calculate the total area surveyed. In addition to species-specific characteristics, identifiability will be impacted by environmental conditions such as water turbidity. Because these factors can vary in time and space, it is important to use the 95% Z statistic only in aggregate. While it will be highly accurate for large samples, any one individual survey may vary in area surveyed.
The protocol detailed below provides guidance on how to create and use these metrics. Though the focus was to characterize deep-water rocky habitat along the Pacific Coast, the methodology described for modified MaxN count is readily applicable to any rotating drop-camera system. The number of camera rotations needed to characterize fish populations will depend on local ecosystem dynamics, but the conceptualization of the modified MaxN will remain the same. Similarly, whereas we used 3D photogrammetric software to analyze stereo video, the techniques described herein are easily applied across software platforms, as long as the precise location of fish in three-dimensional space is possible. Additionally, the approach of applying a 95% Z distance value could be considered in future studies with stereo-cameras to account for species-specific ranges of detectability and to more accurately calculate fish abundance.
NOTE: Screenshots of software steps are included as Supplementary Files. Please note that the software steps described below are specific to the chosen software (see the Table of Materials). The overall approach can be extended to any stereo software platform.
1. Prepare Stereo-camera Footage for Analysis
NOTE: Calibration using a calibration cube is recommended. A calibration cube is a three-dimensional aluminum-frame with precisely positioned reflective dots on the surface. When used in conjunction with calibration software, a calibration cube leads to greater precision and accuracy than checkerboard approaches9.
2. Generate Point Counts and Calculate MaxN
NOTE: Each fish is initially marked with a 2D point to the lowest possible taxonomic resolution. Fish with uncertain ID should be marked for later review.
3. 95% Z distance procedure for species-specific survey areas
NOTE: The 95% Z distance is an estimate of the average distance a species could reliably be identified in a given study while excluding cases of exceptional conditions of water clarity or lighting. This calculation takes into account the average oceanographic conditions for a given study and will need to be re-calculated for each new study.
Table 1: Example MaxN summary table. The selection of MaxN for each species is demonstrated with red and bold text. Note that a conservative MaxN for unidentified Sebastes spp. was determined by the rotation with the most Sebastes identified to species (rotation 3). Also, while this study used eight camera rotations, only four rotations are displayed in Table 1 for simplicity. The process for selecting MaxN is identical regardless of the number of rotations.
Figure 1: Stereo video Lander. Key hardware is numbered (1) 300 m umbilical, (2) two digital video recorders (DVR) with removable 32GB storage cards inside waterproof bottle, (3) two LED lights outputting 3,000 lumens at a color temperature of 5,000 K, and (4) two cameras with 620 TV line (TVL) resolution. Please click here to view a larger version of this figure.
Figure 2: Calibration cube (500 mm x 500 mm x 300 mm). Example of a calibration with a 'calibration cube' shown in two different orientations: (A) the right side of the cube is pushed out towards cameras, and (B) the face of the cube is parallel to the face of the cameras. Red dots denote the reference points used in this particular calibration method and must always be identified in the numbered order. Please click here to view a larger version of this figure.
Figure 3: 3D measurement placed on Sebastes miniatus. The tip of the snout and end of the tail were identified in each camera frame to allow for stereo measurement. Please click here to view a larger version of this figure.
Figure 4: Area surveyed by the Lander tool. Effective area surveyed by the Lander tool was bounded by the minimum Z distance, and the 95% Z distance for each species. Note that this area created a 'donut' shaped survey volume around the Lander. Please click here to view a larger version of this figure.
Between 2013 and 2014, we conducted 816 surveys with the rotating stereo-video Lander (Figure 1) along the central California coast and collected MaxN and 95% Z distance (Figure 4) data on more than 20 species. There were clear patterns in the effective detectable range of species observed, likely due to the interaction of species' size, shape, and coloration (Figure 5). For instance, the Flag Rockfish (Sebastes rubrivinctus) has distinct banding on its sides allowing for confident identification at greater distances than other species of comparable size. Similarly, Canary Rockfish (Sebastes pinniger) are relatively large bodied, but have a pigmentation that is similar to other species, thus making it more difficult to identify at distance (Figure 5).
We use two species to demonstrate the calculations of both MaxN and 95% Z distance values: Pygmy Rockfish (Sebastes wilsoni) and Lingcod (O. elongatus). The former is a small-bodied fish that can be difficult to identify at distance; whereas O. elongatus is relatively large, has a distinct shape, and is more easily identifiable. From 2013 – 2014, 1,191 measurements for S. wilsoni and 1,222 measurements for O. elongatus were collected. Then, the 95% quantiles of distances at which these species were observed: the 95% Z distances were 2.65 m for S. wilsoni and 3.96 m for O. elongatus (Figure 5) were calculated. These 95% Z distances translate into effective survey areas of 18.6 m2 and 46.0 m2 for S. wilsoni and O. elongatus, respectively. A simple bootstrap analysis confirmed that sufficient sample sizes were obtained for characterization of 95% Z distance values. For both species, the estimate of 95% Z distance stabilized when greater than 50 surveys containing these species were sampled, providing strong evidence that the chosen sample sizes were more than adequate to characterize the effective Lander sample area for these species (Figure 6).
MaxN counts per survey were then converted into densities (number of fish/m2). We used density estimates from the 816 surveys to test the hypothesis that Lingcod and Pygmy Rockfish would be observed primarily on high relief habitats. For both species, there were significantly greater densities over high and medium relief compared with low relief habitats (Kruskal-Wallis, p <<.001; Figure 7). These results were consistent with previously reported habitat associations for both species15. There were no differences between medium and high relief habitat for either species.
To understand how the rotating Lander compared with traditional stationary camera systems, we estimated differences in density and variability estimates between a rotating and a simulated stationary Lander. We assumed a typical stationary single-camera Lander would have a 90-degree field of view. The rotating Lander has a 60-degree field of view, and requires 5 seconds of rotation to complete a 90-degree view. Using 261 surveys, we selected fish observation data from the middle 5 seconds of Lander rotations to establish MaxN. Density estimates for the pseudo-stationary Lander were standardized by using the reduced areas of coverage (i.e., approximately ¼ the area of the rotating Lander). Differences in mean density and coefficient of variation between rotating and pseudo-stationary Landers were evaluated with Welch's t-test. Mean densities obtained by the rotating camera were 18% greater than those obtained with stationary cameras (Welch's t21.7, p = 0.081, Figure 8A). Additionally, the coefficient of variation was 1.8 times greater with the stationary camera compared to rotating cameras (Welch's t15.1, p <0.001, Figure 8).
Figure 5: Z distances observed for select species. Red vertical bars denote the minimum Z distance (0.81 m from cameras) on the left and the 95% Z Distance value on the right. Note that this represents the average effective survey area around the Lander for each species. Please click here to view a larger version of this figure.
Figure 6: Bootstrapped Z distance values. Bootstrapping to increase the sample size for (A) S. wilsoni and (B) O. elongatus observations. Sample sizes ranging from 3 – 300 were bootstrapped 1,000 times each to calculate the mean 95% Z distance and verify the sample sizes were adequate. Note that the y-axis values range from 2.0 – 2.6 m for S. wilsoni and from 2.6 – 4.0 m for O. elongatus. Please click here to view a larger version of this figure.
Figure 7: Habitat differences for two select species. Average densities (± SE) of (A) S. wilsoni and (B) O. elongatus measured on low, medium, and high relief rock habitat. Please click here to view a larger version of this figure.
Figure 8: Differences between rotating and pseudo-stationary landers. Both estimates of (A) mean density (fishes/m2 ± SE) and (B) mean coefficient of variation (CV) ± SE for 261 surveys are presented. Please click here to view a larger version of this figure.
Supplementary Video 1: Calibration verification. Calibration can be verified before use in the field by measuring targets of known sizes at varying distances. Please click here to view this video. (Right-click to download.)
Supplementary Video 2: Underwater Survey Footage. Please click here to view this video. (Right-click to download.)
The traditional MaxN metric is predicated on the idea of counting a guaranteed minimum number of individuals present during a survey. If a certain number of fish are simultaneously visible in a single video frame, there cannot be any fewer present, but because fish are mobile and heterogeneously distributed, the likelihood of seeing all individuals simultaneously during a single video frame is low. It is therefore likely that traditional MaxN underestimates true fish abundance16,17. Additionally, it has been demonstrated that traditional MaxN may display non-linear negatively-biased relationships with increasing fish abundances16,18. This may be related to the phenomenon of gear saturation whereby relative abundance indices fail to detect true increases in abundance19,20. Conversely, the apparent stability of an index with truly declining fish abundance has been termed 'hyperstability', and may ultimately lead to the crash of fish populations21,22. A recent study reported that instability in MaxN could be alleviated by increasing the surveyed field of view16. In that study, the relationship between MaxN and true abundance became increasingly linear as the field of view approached 100% (i.e., 360 degrees).
The results from the stationary camera simulation indicate congruence with these previous results, and suggest that the MaxN value may better characterize fish abundance. For example, the estimated mean coefficient of variance was reduced among density estimates derived from the rotating Lander compared with the pseudo-stationary Lander. This is likely due to the fact that fish are heterogeneously distributed, and that stationary cameras are more likely to 'miss' the fish present if the Lander faces the wrong direction. Rotating Landers maximize sampling effort by surveying the full 360 degrees around the tool, and the net effect is reductions to both sampling cost and variance, and an overall increase in the statistical power of the study. Future studies could better address this issue by directly testing a rotating Lander with a separate stationary Lander in a paired survey design. Similarly, we were unable to directly test for the relationship between MaxN and true abundance in this study, and future studies might directly test this using either simulation or controlled environments, as was done in Campbell (2015)16.
A possible criticism of the modified MaxN approach is the possibility of double counting individuals. Because the Lander made one full rotation per minute, and the benthic species of interest in the ecosystem tend to be relatively sedentary and slow moving under most conditions, we believe the risk of double counting was low. Additionally, cases where fish would enter or leave the survey area over the course of the eight rotations were observed. Additional precautions to avoid double counting such as using the rotation with the greatest number of individuals of a given Genus to count unidentified species were taken. Other metrics have been proposed as indices of fish abundance such as Mean Count; however, these too have been shown to consistently underestimate true abundance while increasing variability among density estimates16. MaxN is therefore recommended as a more precise metric of fish abundance. While our modified MaxN metric does not guarantee a conservative estimate of absolute minimum number of individuals, we are overall confident that this modified MaxN approach provides better estimates of true fish abundance, and that over-counting fish is of relatively low concern.
Many side-viewing video-transect surveys use a fixed transect width to estimate density for all species. Similarly, one approach to using stereo-video Landers would be to use one maximum distance-from-camera to calculate both area surveyed and fish density. Both may lead to an underestimate of species which are only reliably identifiable to smaller distances than the fixed transect width estimates23. The distance to which a species is reliably identified is caused by the interaction of factors such as size, shape, coloration pattern, fish behavior, as well as environmental factors. The 95% Z distance method is particularly advantageous in that it accounts for the interaction of all these factors simultaneously. For example, O. elongatus was the species that we are able to identify to the greatest distance, likely as a result of its distinct, large, elongate body shape and behavioral tendency to lay on the seafloor. Rosy Rockfish (Sebastes rosaceus) had one of the shortest Z distances, likely because, as a member of the Sebastomus subgenus, it has several congeners that look very similar and are difficult to distinguish at increased distances. By allowing for species-specific areas surveyed by the Lander, we may be able to more accurately estimate fish abundance. The bootstrap approach to sample size verification is simple and readily implemented in other surveys, and we believe the method of 95% Z distance could be further adapted to accommodate line transect survey design. 95% Z distance would then represent a horizontal distance of reliable detection for species observed with submersible or remotely operated vehicle (ROV) tools. In the future, researchers may investigate using distance sampling theory to model density as a function of detectability with distance23,24.
As there is greater use of no-take reserves in fisheries management25,26,27, there is an increasing need for non-extractive sampling techniques, especially in deep water habitats not accessible to diver surveys. However, it is also necessary that those techniques provide accurate, reliable data on fish length, abundance, and species composition. Video Landers are a relatively new monitoring tool that have a low cost, can be operated on relatively small vessels of opportunity, and are logistically simpler to operate than ROVs and submersibles while requiring fewer and less skilled personnel. While not discussed in these methods, stereo-camera Landers are capable of accurate length measurements with error less than 2%. Additionally, Landers can be rapidly deployed over large geographic areas, increasing statistical inference. We expect the interest in video monitoring tools to increase as research agencies look to tighten budgets and more efficiently spread sampling effort. Our modification of MaxN and 95% Z distance should be considered in future ecological studies utilizing rotating video Landers.
The authors have nothing to disclose.
This work was funded by The Nature Conservancy and private donors, Resources Legacy Fund Foundation, Gordon and Betty Moore Foundation, Environmental Defense Fund, California Sea Grant Program, the NMFS National Cooperative Research Program, and a NOAA Saltonstall-Kennedy Grant #13-SWR-008. Marine Applied Research and Exploration (Dirk Rosen, Rick Botman, Andy Lauerman, and David Jefferies) developed, constructed and maintained the video Lander tool. We thank Jim Seager and SeaGIS™ software for technical support. Captain and commercial fisherman Tim Maricich and crew onboard the F/V Donna Kathleen provided support in deploying the Lander from 2012-2015. Thank you to all who participated in video data collection or analysis (Anne Tagini, Donna Kline, Lt. Amber Payne, Bryon Downey, Marisa Ponte, Rebecca Miller, Matt Merrifield, Walter Heady, Steve Rienecke, EJ Dick, and John Field).
calibration cube | SeaGIS | http://www.seagis.com.au/hardware.html | 1000x1000x500 mm is the preferred dimensions. Other methods of calibration are available. |
CAL calibration software | SeaGIS | http://www.seagis.com.au/bundle.html | |
EventMeasure stereo measurement software | SeaGIS | http://www.seagis.com.au/event.html | |
Statistical software | R Core Team 2017 (v. 3.4.0) | Bootstrapping code can be found: https://github.com/rfields2017/JoVE-Bootstrap-Function | |
Spreadsheet Software | Microsoft Excel | ||
2 waterproof cameras | Deep Sea Power and Light | HD quality preferred | |
2 depth rated, waterproof lights | Deep Sea Power and Light : 3000 lumen LED with 5000k color temperature | ||
DVR recorder | Stack LTD DVR | ||
standard PC | Windows 10 preferred OS | ||
rotating Lander platform | Marine Applied Research and Engineering (MARE) |