Background:
Effective research and monitoring requires accurate, interoperable and representative data. Video-based methods are commonly used to survey fish and are rapidly expanding globally due to their cost-effectiveness, ability to provide accurate body size measurements, non-destructive sampling approach and capacity to create permanent data records. However, the effectiveness of video-derived data depends on standardised approaches which produce reliable, reproducible, and error-free data. A national synthesis of video-based fish survey datasets identified numerous errors in data collection and annotation, many of which are also relevant to other survey methods.
Aims:
To develop an open-source toolkit, CheckEM, for quality control checks on fish survey data.
Development:
The CheckEM web application and R package identifies errors in metadata and cross-checks annotations of fish with taxonomic databases, expected spatial distributions, and maximum body sizes. CheckEM flags species observed beyond their known range, outdated scientific names, and body size outliers. CheckEM standardises, cleans, and visualises datasets, offering interactive plots and tables in a user-friendly interface. Downloadable summary data and error reports support iterative checks and improvement of data quality.
Conclusions and Implications:
CheckEM enhances data accuracy, confidence, interoperability, and reusability, improving collaboration and cross-dataset comparisons to support robust analyses and informed marine resource management.