This applied experimental paradigm replicates circumstances by which an eyewitness to a real crime may create a holistic facial composite of the culprit from memory, and then attempt to identify the culprit from a video line-up containing the culprit or one in which he or she is not present.
The paradigm detailed in this manuscript describes an applied experimental method based on real police investigations during which an eyewitness or victim to a crime may create from memory a holistic facial composite of the culprit with the assistance of a police operator. The aim is that the composite is recognized by someone who believes that they know the culprit. For this paradigm, participants view a culprit actor on video and following a delay, participant-witnesses construct a holistic system facial composite. Controls do not construct a composite. From a series of arrays of computer-generated, but realistic faces, the holistic system construction method primarily requires participant-witnesses to select the facial images most closely meeting their memory of the culprit. Variation between faces in successive arrays is reduced until ideally the final image possesses a close likeness to the culprit. Participant-witness directed tools can also alter facial features, configurations between features and holistic properties (e.g., age, distinctiveness, skin tone), all within a whole face context. The procedure is designed to closely match the holistic manner by which humans’ process faces. On completion, based on their memory of the culprit, ratings of composite-culprit similarity are collected from the participant-witnesses. Similar ratings are collected from culprit-acquaintance assessors, as a marker of composite recognition likelihood. Following a further delay, all participants — including the controls — attempt to identify the culprit in either a culprit-present or culprit-absent video line-up, to replicate circumstances in which the police have located the correct culprit, or an innocent suspect. Data of control and participant-witness line-up outcomes are presented, demonstrating the positive influence of holistic composite construction on identification accuracy. Correlational analyses are conducted to measure the relationship between assessor and participant-witness composite-culprit similarity ratings, delay, identification accuracy, and confidence to examine which factors influence video line-up outcomes.
If the police have no suspect for a crime, an eyewitness, often the victim; may create a facial composite of the culprit from memory with the assistance of a police system operator 1. The aim is that someone familiar with the culprit will recognize that image. An identified suspect — who may not be the actual culprit — may be placed in a line-up to see whether the original eyewitness can identify them or not. Many eyewitnesses make misidentifications. From real police line-ups over 20% identify a foil in the UK 2 and the USA 3, known to be innocent but included in the line-up to provide a test of the witness’ memory of the culprit. Sometimes, witnesses select a factually innocent police suspect. This type of error may be the leading cause of wrongful convictions 4-6. Eyewitnesses created facial composites in 46 of the first 250 US DNA-exoneration miscarriage of justice cases 4, and many subsequently identified the innocent individual from a line-up. There may have been alternative inculpating evidence, so that the facial composite procedure was not necessarily responsible. However, line-up accuracy can be impaired e.g., 7-10, unaffected e.g.,8, or enhanced e.g., 11,12 by composite creation, and the aim of applied research of the type described here is to suggest best practice procedures that may be adopted by the police, when facial composite construction is followed by a line-up.
Police forces in many countries employ computerized holistic facial composite systems e.g., 13-15, replacing the previously ubiquitous feature-based systems (for reviews of composite systems see 16-17). This is mainly because holistic systems more closely match the whole face ‘Gestaltic’ manner in which humans of all ages process and recognize faces e.g., 18-20. With feature-based systems, witnesses are required to first, verbally recall the suspect’s facial features (e.g., eyes, nose, and mouth), second, to locate these individual features from often limited feature databases and third, to assemble them into a whole composite. Each step is cognitively demanding — few people possess the vocabulary to provide a detailed facial description, feature-by-feature facial analysis is prone to error, and verbal recall tends to be less accurate than recognition, possibly due to a mismatch between the modality in which faces are encoded (visual mode), and the retrieval of the face from memory (verbal mode) 21-22. Not surprisingly feature-based composites are often a poor culprit likeness 16-17.
With holistic facial composite systems 13-15, witnesses select from a series of realistic but computer-generated facial arrays, the image that most closely matches their memory of the culprit. Thus the primary retrieval mode is visual, matching the mode in which faces are encoded. The interfaces of the different systems vary, including whether images are in color or not, array numbers, and whether the witness views whole faces, or faces with the external features removed 17. However, with all holistic systems, successive arrays of images are used to achieve a step-by-step improvement in composite-to-culprit likeness. At each step, the faces comprising an array are determined by the witness’ previous selection(s) and an evolutionary algorithm. The witness selects a face from the array, and the evolutionary algorithm breeds those selections to create the new generation of faces in the following array. Additional tools can add clothing, tattoos, facial hair or other individuating marks, manipulate facial feature size and placement, and adjust holistic properties (e.g., age, distinctiveness). The process is complete when the witness is satisfied with their composite. Throughout, the police operator advises but does not influence the procedure.
In conjunction with an initial Cognitive Interview 23-24, with an emphasis on the free recall component which facilitates information quality and quantity, holistic system production promotes whole face recognition, rather than as required with feature-based systems — analyses of isolated facial features. Manipulation of facial features and their configurations always occurs within the context of a whole face, and as a consequence, holistic system composites are often of higher quality, and recognized more readily by people familiar with those depicted than composites created using feature-based systems 17, 25-26. Police field surveys also suggest that holistic system composites generate higher rates of suspect identification e.g., 27-28. Furthermore, even children as young as six-years-of age 8, and adults with intellectual disabilities 29 can understand their task demands. For these reasons, particularly if a witness has difficulty in describing a face, the Association of Chief Police Officers (ACPO) in England and Wales recommend the use of holistic systems 30.
Once a potential suspect has been identified by the police, they may ask the composite-creating witness to view that suspect in a line-up or identity parade. Some research has found that composite creation enhances identification accuracy, when performance is compared with non-composite creating controls e.g., 11-12. Other research, primarily employing feature-based systems has found that composite creation negatively impacts identification performance e.g., 7, 9-10. If a composite is a poor likeness to the culprit, identification accuracy appears most susceptible 8, 10. This suggests that for creating witnesses, a facial composite may provide a more salient memory trace than that of the original suspect. Nevertheless, all other things being equal, the chances of a correct identification should be enhanced by the creation of a holistic system composite, as these are likely to be closer in likeness to the culprit than a feature-based composite.
The research paradigm described here closely replicates a design and procedure used with young adult 8, 11, 31, child 8, and older adult 31 participants. Composite creating participant–witnesses and non-composite creating controls view an initial ‘culprit-actor’ crime scene video. After a delay, the participant-witnesses construct a holistic system facial composite. Subsequently, after a further delay, all participants attempt to identify the suspect from a video line-up, the technology used in virtually all of approximately 110,000 formal identification procedures per annum in the UK 2,32. The mean delay between crime scene video viewing and line-up will be equal for both groups. Based on a large body of empirical research, there is an ongoing debate as to the best procedure for conducting a line-up. Some researchers have argued in favor of sequential line-ups over the alternative simultaneous line-ups e.g., 33-34. Others have an opposing view e.g.,35-36. However, correct identification rates are higher from video line-ups than the often recommended US sequential procedure 37-38. Regardless, the methodology described here has strong ecological validity as it closely matches procedures used in real police investigations, and could be adapted for use with different line-up types. Methods of ensuring that the line-up is ‘fair’ to the suspect are also described 39-40. In addition, the holistic composite system employed is the standard used by the majority of UK police forces, the interviewing techniques are those normally used by police composite operators, and the video line-ups were constructed by the London Metropolitan Police Service as though a real investigation, following guidelines in the Police and Criminal Evidence Act (PACE) Codes of Practice (Code D), which prescribes police identification procedures in England and Wales 41. Furthermore, the design is flexible, allowing for the measurement of alternative variables that might impact on composite quality and identification accuracy (e.g., composite system, culprit-actor gender, age, or ethnicity; delay between procedures).
In making decisions as to whether to publicize a facial composite or not, the police operator will likely ask the creating-witness to assess final composite quality. For this paradigm, following construction, each participant-witness rates their composite for likeness to their memory of the culprit. These ratings are compared to those provided by independent assessors — highly familiar with the person the composite is supposed to depict, so as to provide a more objective recognisability measure. These ratings serve as a proxy for the more ecologically valid method of asking people familiar with the culprit to attempt to name the composite. However, assessor ratings correlate with naming rates suggesting they are essentially measuring the same construct 42. They also positively correlate with adult, but not child participant-witness creator assessments of quality 8.
In summary, this paradigm employs an independent measures design. The first factor is participant role — participants are allocated to either the participant-witness composite creating group or the non-composite creating control group. The second factor is culprit presence — participants are presented with a culprit-present video line-up containing the culprit viewed in the original crime scene video, or a culprit-absent line-up in which the culprit is replaced by an additional foil. A third factor, if varied, is the delay between viewing the crime scene video and the line-up. The primary dependent variable is line-up accuracy. Within the participant-witness composite-creating group, a correlational design measures the relationship between ratings of composite-culprit similarity provided by participant-witnesses and independent assessors, some of whom should be acquaintances of the culprit. The relationship between the composite-culprit similarity ratings and video line-up accuracy is also examined.
Procedures involving human participants were approved by the University Research Ethics Committee of the University of Greenwich, following guidelines issued by the British Psychological Society.
1. Viewing of Crime Scene Video Depicting the ‘Culprit’
Figure 1. Stills from the crime scene video. (A, B) Two stills from the crime scene video depicting full body and facial views of the culprit (see 1.3). Please click here to view a larger version of this figure.
2. Participant-witness Creation of a Facial Composite with the Assistance of an Operator
NOTE: This section of the protocol has been optimized for the holistic facial composite system, EFIT-V, but can be adapted for other software.
1 | How old did the culprit appear to be? | |||||||
2 | What do you remember about the culprit’s hair (length, type, style, color)? | |||||||
3 | What do you remember about the culprit’s face (shape, length, breadth, complexion)? | |||||||
4 | What do you remember about the culprit’s ears (shape, size, position, lobes)? | |||||||
5 | What do you remember about the culprit’s nose (length, tilt, nostrils, shape, ridge)? | |||||||
6 | What do you remember about the culprit’s eyebrows (thickness, space, shape, color)? | |||||||
7 | What do you remember about the culprit’s eyes (shape, size, depth, space, shade, color)? | |||||||
8 | What do you remember about the culprit’s mouth/lips (width, shape, upper, lower)? | |||||||
9 | What do you remember about the culprit’s chin (shape, size, type)? | |||||||
10 | What do you remember about the culprit’s facial hair (beard, moustache, stubble)? | |||||||
11 | Did the culprit wear spectacles? | |||||||
12 | Was there anything distinctive about the culprit (marks or scars)? | |||||||
Note: | The operator should only ask any of these questions, if the description associated with that question is missing from the participant-witness’ free recall account. |
Table 1. Cued Post Cognitive Interview Questions.
Figure 2. Facial composite construction method A: Face shape. At this stage in the facial composite construction procedure, after the operator enters basic description keywords into the holistic composite system, the participant-witness is asked to select an approximate face shape meeting their memory of the culprit from the nine images displayed on the screen, or to reject that array to produce a new display. As with the reminder of the construction process, this stage assesses recognition (see 2.9.1). Please click here to view a larger version of this figure.
Figure 3. Facial composite construction method B: Hairstyle tool. Following selection of face shape, and facial features, the participant-witness is asked to select an approximate hairstyle from the nine images displayed on the screen, or to reject that array to produce a new display. The default hairstyle on all images is grey, until coloring is added (see 2.9.3). Please click here to view a larger version of this figure.
Figure 4. Facial composite construction method C: Shoulders tool. Following selection of face shape, and facial features, the participant-witness is asked to select shoulders from the nine images displayed on the screen, or to reject that array to produce a new display. Clothing color and style can be manipulated and company logos or other idiosyncratic features may be added (see 2.9.4). Please click here to view a larger version of this figure.
Figure 5. Facial composite construction method D: Local attributes tool. After the shoulders are selected, the participant-witness views a series of facial arrays possessing faces of differing variability from each other, although variability reduces in subsequent arrays, as each ‘best’ image is chosen. At this point in creation, the participant-witness may suggest changes to specific facial features, and compare the outcome to the original unmodified image on the screen. Even though changes are made to features, the methodology still accesses holistic processes as changes are made in the context of whole face comparison (see 2.9.7). Please click here to view a larger version of this figure.
Figure 6. Facial composite construction method E: Holistic attributes tool. The participant-witness may also suggest changes to the holistic properties of the selected face (e.g., age, distinctiveness) by using a slider tool. Again, the outcome is compared to the original unmodified image on the screen (see 2.9.8). Please click here to view a larger version of this figure.
Figure 7. Facial composite construction method F: Final image. In a police investigation this image would be printed, and a copy transferred to CD to be retained in the evidence bag (see 2.9.9). Please click here to view a larger version of this figure.
3. Collection of Post Composite Construction, Participant-witness Ratings of Culprit-composite Likeness
4. Culprit-acquaintance Assessment of Culprit-composite Similarity
5. Preparation of the Video Line-ups Containing the Culprit and Foils
NOTE: This section of the protocol has been optimized for the video line-up system PROMAT, although other systems are available.
Figure 8. Video lineup stills. (A,Please click here to view a larger version of this figure. B) Frontal and profile facial image stills of the culprit in the culprit-present video line-up procedure (see 5.2). Please click here to view a larger version of this figure.
6. Mock Witness Paradigm Pilot Study to Test Video Line-up Fairness
7. Presentation of the Video Line-up and Questionnaire
Instructions | |||||
The following is a cued description form, please try to enter comments in each section (if you can) appertaining to the particular aspect of the person (the culprit) that you saw in the original video clip. As describing a person is often a difficult task, it is important that you concentrate and stay focused for the next few minutes. Prior research has also demonstrated the importance of striving for accuracy and reporting only that which you are certain you remember. | |||||
1 | Ethnic appearance | ||||
2 | Altura | ||||
3 | Apparent age | ||||
4 | Gender | ||||
Please circle one or more responses to the following questions | |||||
5 | Build | Fat, Proportional, Thin, Stocky, Athletic, Heavy, Other | |||
6 | Hair color | Dark brown, Light brown, Fair, Blonde, Grey, White, Black, Ginger, Auburn, Other | |||
7 | Hair type | Bald, Thinning, Receding, Straight, Curly, Wavy, Dyed, Short, Collar length, Shoulder, Very long, Wig, Length, Other | |||
8 | Eyes | Blue, Brown, Green, Grey, Cast, Staring, Other | |||
9 | Complexion | Fresh, Pale, Ruddy, Tanned, Fair, Freckled, Dark tone, Mid tone, Light tone, Other | |||
10 | Facial hair | Beard, Moustache, Bushy, Sideburns, Eyebrows, Other | |||
Clothing: Enter brief description (if appropriate) | |||||
11 | Shoes | 12 | Socks | ||
13 | Trousers | 14 | Belt | ||
15 | Shirt | 16 | Jacket | ||
17 | Skirt | 18 | Dress | ||
19 | Jumper | 20 | Topcoat | ||
21 | Jewelry | 22 | Hat | ||
23 | Outro |
Table 2. Cued Description Form.
8. Data Analyses
NOTE: Some of the data collected (e.g., descriptions of the culprit) in this paradigm are included primarily to ensure that procedures conform to normal police practice in England and Wales and not specifically for later analyses. Nevertheless, it would be possible to analyze these data to test for relationships between perhaps description quality and quantity, and composite quality and identification accuracy. However, these would be supplemental analyses, and the data analyses listed here are those that are most likely to be employed to investigate important experimental hypotheses.
The data reported here are a subset of data collected in two studies in which the described experimental paradigm was partially followed 8, 11.
Line-up fairness checks
The pilot mock witness paradigm described in Section 6, is designed to ensure that the line-up is not biased against a suspect, in that they should not stand out in any manner to induce selection more often than would be expected by chance 39, 40. From this procedure, a measure of line-up fairness is calculated by ensuring no member is selected significantly more often than would be expected by chance alone by the mock witnesses (e.g., 1/9 = 11.1%). Tredoux’s E 39 measure of functional size is applied to assess the number of line-up members who are ‘plausible’, and in a real line-up would provide a suitable test of the witness’ memory. Ideally this value should be close to the maximum (e.g., nine). For the representative data reported in 11 the line-up was found to be fair as mock witnesses selected the culprit at close to chance levels (10.9%), and the vast majority of foils were plausible (Tredoux’s E = 7.05).
Matched condition checks
The next analysis uses an independent-measures t-test 43 to ensure that the ratings of prospective confidence in being able to recognize the culprit, collected shortly after the participants view the crime scene video are approximately equal in the two experimental conditions (see 1.6). The outcome should be non-significant. In 11, the responses on this scale described in 1.6 were as expected approximately equal, t(266) = .57, p > .2, indicating the participants were matched prior to any other procedure.
Composite-culprit similarity ratings
To test hypotheses concerning the objectivity of the participant-witness’ self-assessments of their own composites, the second analysis examines whether there is a relationship between the composite-suspect similarity ratings provided by the participant-witnesses to their own composite only (see Section 3), and those provided by the independent culprit-acquaintance assessors (see Section 4). In 8, a Pearson’s correlation test 43 on the ratings provided to all 57 holistic facial composites was non-significant. However, when follow-up analyses were conducted with child- and adult-participant-witness data separated, there was a positive relationship between these ratings to the 26 adult composites, r(26) = .46, p < .05; but not to the 31 children’s composites, r(31) = .01, p > .2; an indication that adult witnesses, but not children, can provide objective assessments of the quality of their own composites.
Video line-up responses
To test hypotheses related to line-up performance, hierarchical loglinear analyses 43, or chi-square tests 43 examine the effects of experimental condition on line-up outcomes (see 7.9). The reported effect size measure for these nominal data analyses is Φ, although odds ratios (OR) are also reported to provide a measure of the association between the two reported variables. As with most eyewitness research, culprit-present and culprit-absent video line-up data are separated. Each participant makes one line-up decision only.
The influence of composite construction on eyewitness identification is measured by comparing the line-up selections of participant-witnesses and controls. Table 3 displays representative results taken from a subset of the data collected in Experiment 1 of 11 in which control line-up outcomes were compared with participant-witnesses who created a holistic facial composite using the system described in this protocol. The delay between viewing the initial culprit crime scene video and the video line-up in this experiment was approximately 2 hr.
Culprit-present line-ups provide an indication of the sensitivity of an identification procedure as measured primarily by suspect identification rates, which in this paradigm are correct culprit identifications. Other outcomes are incorrect foil identifications or incorrect line-up rejections.
Culprit-absent line-ups provide an indication of the fairness of the procedure, as measured by correct line-up rejection rates. Other outcomes are incorrect foil identifications. There was no designated ‘innocent suspect’ in this research, and therefore the first column in Table 3 is blank for culprit-absent trials.
Culprit-present choosing behavior: From the data reported in Table 3 originally presented in Experiment 1 in 11, the first analysis examines whether composite creation influences choosing behavior from a line-up as this may be indicative of a response bias. A 2 (participant role – participant-witness vs. control) x 2 (choosing behavior – chooser: culprit identification or foil identification vs. non-chooser: incorrect line-up rejection) chi-squared test 43 on the rates of selection of each outcome was not significant, χ2(1, n = 108) < 1, p > .2, Φ = .072. Participant-witnesses (80.0%) were roughly equally likely as controls to be line-up choosers (73.1%, OR = 1.09).
Culprit-present correct identifications: The second and most critical culprit-present analysis examines response accuracy only. A 2 (participant role) x 2 (accuracy — correct: culprit identification vs. incorrect: foil identification or line-up rejection) chi-squared test 43 on the data from Table 3 first presented in Experiment 1 in 11 was significant, χ2(1, n = 108) = 5.48, p = .019, Φ = .225. Participant-witnesses (70.0%) made approximately one-and-a-half times more correct culprit line-up selections than controls (44.9%, OR = 1.56).
These results are consistent with a follow-up experiment (Experiment 2 11) in which newly qualified police operators were recruited, the crime scene videos depicted six different culprits, and the mean delay between crime scene video viewing and viewing the video line-up was approximately 30 hr (participant-witness correct culprit identification rates = 48.8%; controls = 35.0%), and a meta-analysis finding positive effects of composite construction on line-up identification 12. However, other research 8, using the same basic experimental paradigm, holistic facial composite system, and line-up type, but with different culprit-actors found no significant differences in correct identification rates between adult composite-creating participant-witnesses (34.6%) and controls (31.7%). Furthermore, in that research 8, child controls between the ages of 6- and 11-years made more correct identifications (42.9%) than child participant-witnesses of the same age (19.4%). The latter result may be a consequence of the children’s facial composite being significantly inferior to the adult’s, children’s initial memory for the culprit being worse or they struggled to understand the use of confidence scales. However, this result is consistent with research finding a positive relationship between composite quality and rates of correct identification from line-ups 10. This explanation is also consistent with most previous research of this type, which has used the often inferior feature-based composite systems, finding that identification accuracy was reduced following composite construction e.g., 7, 9-10.
Culprit-present foil identifications: The third culprit–present analysis examines whether the proportion of foil identifications differs by condition. A 2 (participant role) x 2 (foil or not — foil identification vs. other decision: correct culprit identification or incorrect line-up rejection) chi-squared test 43 on the data reported in Table 3 and 11 was significant, χ2(1, n = 108) = 4.04, p = .045, Φ = .193. Controls (28.2%) made nearly three times as many foil selections as participant-witnesses (10.0%; OR = 2.82).
Culprit-absent lineup rejections: As there are only two outcomes associated with the culprit-absent data from Table 3 as originally reported in 11, only one test is conducted. A 2 (condition) x 2 (accuracy — correct line-up rejection vs. incorrect foil identification) chi-squared test 43 was not significant, χ2(1, n = 100) < 1, p > .2, Φ = .055. There were no differences in correct line-up rejection rates between participant-witnesses (44.4%) and controls (38.4%; OR = 1.16). These results are consistent with previous research finding similar null effects in culprit-absent trials 10.
Relationship between composite quality and participant-witness line-up accuracy
A further analysis examines the relationship between the quality of the facial composites and the likelihood of correct culprit video line-up identifications. The representative data reported here are from the second experiment reported in 11 in which all line-ups were culprit-present. A point biserial correlation test 43 conducted on the relationship between line-up accuracy (1 = correct; 0 = incorrect) and culprit-acquaintance ratings of culprit-composite similarity, was not significant, r(45) = -.05, p > .2, suggesting that unlike some previous research 8, 10, there was no relationship between the quality of the participant-witness’ facial composite and the accuracy of their video line-up responses. This unexpected non-significant finding may be the result of a number of extraneous variables (e.g., delay, multiple culprit-actor variables).
Total | Suspect ID | Foil ID | Line-up rejection | ||||
n | n | % | n | % | n | % | |
Culprit-present | |||||||
Controls | 78 | 35 | 44.9 | 22 | 28.2 | 21 | 26.9 |
Witnesses | 30 | 21 | 70.0 | 3 | 10.0 | 6 | 20.0 |
Culprit-absent | |||||||
Controls | 73 | – | 45 | 61.6 | 28 | 38.4 | |
Witnesses | 27 | – | 15 | 55.6 | 12 | 44.4 |
Table 3. Culprit-present and culprit-absent video line-up outcomes. Number of participants (n) and percentage of each type of line-up outcome as a function of culprit presence, and participant role from the subset of data originally published in Experiment 1 of 11, in which adult participant-witnesses used the same holistic composite system as described in the current protocol.
The creation of a facial composite may provide the first lead in a police investigation 1. A composite constructing witness may subsequently be asked to view a line-up containing the police suspect. The police suspect may in fact be the guilty culprit, in which case the line-up will be culprit-present, or they may be innocent, and the line-up will be culprit-absent. The applied experimental paradigm described here has been employed in research demonstrating the positive influence of holistic facial composite production on correct culprit-actor selections from culprit-present sequential nine-person video line-up procedures, without having any impact on culprit-absent procedures 11. The paradigm possesses strong forensic and ecological validity as it closely replicates police procedures used in England and Wales. Indeed, although not a central component of the analyses, the protocol closely follows the type of interview procedures (e.g., Cognitive Interview 23-24) that the police may employ in these circumstances, although it would be of interest to vary components of the Cognitive Interview in future research to examine this as a separate variable. Furthermore, even though a description is not actually required for holistic composite construction, an operator will still ask a witness to describe the culprit in order to demonstrate good practice, as if the description and appearance of the composite match, it suggests no undue influence on construction from the operator (for instance, concerns might be raised in court if the composite and police suspect possess blond hair, the description dark hair).
Other aspects of the procedure should be followed to enhance forensic validity. For instance, real witnesses would normally be unaware in advance that they will be witnessing a crime, and pre-warning participants that they will either create a facial composite or be asked to make an identification decision may influence the manner in which they attend to the initial video of the culprit. Therefore it is normal to avoid warning participants that the research is investigating eyewitness procedures, and for this purpose the title may often be slightly misleading (e.g., “video analysis study”). Furthermore, to avoid inadvertent experimenter bias, the various experimenter roles (e.g., composite system operator, line-up administrator) should ideally be conducted by different people. For similar reasons, the operator, who should be fully trained in the use of the composite system, and the line-up administrator, should both be unfamiliar with the culprit, and have never seen the culprit videos. Ideally, too, the line-up procedure should be conducted double-blind, to ensure no administrator bias 44-46, although double blind procedures are not prescribed in PACE Code D identification procedures for England and Wales. Similarly, PACE Code D does not require measures of identification confidence to be collected, even though as with this paradigm it is often collected for research purposes. Indeed, confidence may provide a marker for line-up accuracy. As such, there is normally a strong positive relationship between confidence and accuracy in line-up choosers, but not non-choosers 47-49.
The basic paradigm could also easily be adapted for use with different types of facial composite system (see for example 11 in which the influence on video lineup outcomes of creating a feature-based facial composite was compared with creating a holistic system composite). There are a number of available holistic composite systems, and although all work on holistic principles, the interfaces differ 13-15. The paradigm would also allow for comparisons with different types of line-up that may be standard in the UK or other legal jurisdictions. For instance, in the USA, most identification procedures are six-person sequential or simultaneous photographic line-ups 33-36, whereas Poland mainly employs four-person simultaneous line-ups 50. Although nowadays very rare in the UK, some jurisdictions elsewhere may still regularly employ live line-ups with all members present in person. For the purposes of the experimental paradigm described here an additional foil was chosen by random from the nine selected by the police officer line-up administrator to replace the culprit in the culprit-absent line-up. However, some eyewitness identification paradigms, may, in advance, specifically select an ‘innocent suspect’ for this purpose. A culprit-absent line-up will therefore be created that might contain a number of different foils from those included in the culprit-present line-up e.g., 51, as a real police line-up will be individualized to the specific suspect. To further examine the influence of composite creation on line-up outcomes, combining the results of culprit-present and culprit-absent line-ups would also inform as to whether memory sensitivity or response bias to make a selection is influenced (see for example 52 which discusses the use of signal detection measures for this type of analysis).
In addition, there are many variables that are known to affect face recognition and eyewitness identification accuracy, and the design could accommodate the testing of these to examine whether they provide an additional influence when facial composite production is followed by a line-up (e.g., culprit-actor gender, age, or ethnicity; participant confidence, delay between procedures; foil selection methods, participant culprit view quality; witness description quality; for a review of ‘estimator’ variables of this type see 53). Nevertheless, one issue with this type of design is that large numbers of participants are required for sufficient statistical power, as line-up responses are primarily dichotomous in nature (e.g., correct vs. incorrect), and as with a real police investigation most participants will create one composite and view one line-up only. Furthermore, the design is time-consuming. To replicate police investigations, there should ideally be a delay between the three phases of the study, and composite construction alone can often take over an hour. Despite these issues, it is important that new technology that may be employed by the police during an investigation (e.g., a facial composite system), is empirically tested in the laboratory to investigate the effect this technology has on subsequent investigatory procedures (e.g., line-ups). The fate of a real police suspect, regardless of guilt, may partly depend on whether best practice is followed or not.
The authors have nothing to disclose.
The research was partly funded by an internal University of Greenwich grant to the first and third authors. Thanks go to Dominic Goodchild, Chris Hughes, Adrian Ibanescu, Corrado Ranelli, and Charlie Shaw for acting as culprit-actors in the crime scene videos, and to Henry DC Williams for filming and directing the crime scene videos. Thanks also to Detective Chief Inspector Mick Neville, Inspector Barry Burnell and Sergeant Nick Milbourn from the Metropolitan Police Service for creating and supplying the PROMAT video line-ups for the project and advising on the design and video line-up procedure. Thanks also to Detective Constable Tony Barnes of the Metropolitan Police Service for providing advice on the use of the specific holistic facial composite system in police investigations, and allowing the second author of this paper, who is the composite operator depicted in the videos, to shadow him while working with eyewitnesses to create facial composites during investigations into real crimes.
Video Line-ups | Promat Envision International | PROMAT | Website: http://www.promatenvision.co.uk Contact: info@promatenvision.co.uk |
Statistics Software | IBM | SPSS Statistics 20 | Website: http://www-01.ibm.com/software/uk/analytics/spss/ Contact: +44 (0) 870 542 6426 |
Online Survey Tool | Qualtrics | Qualtrics | Website: http://www.qualtrics.com Contact: 1-800-340-9194 |
Facial Composite System | VisionMetric Ltd | EFIT-V 5.006 | Website: http://www.visionmetric.com/ Contact: efit@visionmetric.com. |