This paper describes a method for conducting multi-user experiments on decision-making and navigation using a networked computer laboratory.
Investigating the interactions among multiple participants is a challenge for researchers from various disciplines, including the decision sciences and spatial cognition. With a local area network and dedicated software platform, experimenters can efficiently monitor the behavior of the participants that are simultaneously immersed in a desktop virtual environment and digitalize the collected data. These capabilities allow for experimental designs in spatial cognition and navigation research that would be difficult (if not impossible) to conduct in the real world. Possible experimental variations include stress during an evacuation, cooperative and competitive search tasks, and other contextual factors that may influence emergent crowd behavior. However, such a laboratory requires maintenance and strict protocols for data collection in a controlled setting. While the external validity of laboratory studies with human participants is sometimes questioned, a number of recent papers suggest that the correspondence between real and virtual environments may be sufficient for studying social behavior in terms of trajectories, hesitations, and spatial decisions. In this article, we describe a method for conducting experiments on decision-making and navigation with up to 36 participants in a networked desktop virtual reality setup (i.e., the Decision Science Laboratory or DeSciL). This experiment protocol can be adapted and applied by other researchers in order to set up a networked desktop virtual reality laboratory.
Research on spatial cognition and navigation typically studies the spatial decision-making (e.g., turning left or right at an intersection) and mental representation of individuals in real and virtual environments1,2. The advantages of virtual reality (VR) include the prevention of ethical and safety issues (e.g., during a dangerous evacuation3), the automatic measurement and analysis of spatial data4, and a balanced combination of internal and external validity5,6,7. For example, Weisberg and colleagues extended previous research on individual differences in spatial knowledge acquisition by demonstrating that spatial tasks in VR can provide an objective behavioral measure of spatial ability8. This study also suggested that the navigation behavior in VR approximates real-world navigation because the virtual environment was modeled after the university campus used by Schinazi and colleagues9 (see also the study of Ruddle and colleagues10). VR has also been applied to psychotherapy11, clinical assessment12, consumer behavior13, and surgery14,15. However, most VR systems lack proprioceptive and audio feedback that may improve presence and immersion16,17,18,19, require training with the control interface20,21,22, and lack social cues. Indeed, people in the real world often move in groups23, avoid or follow other people3,24, and make decisions based on social context25,26.
At the same time, research on crowd behavior often focuses on emergent characteristics of crowds (e.g., lane formation, congestion at bottlenecks) that are simulated on a computer or observed in the real world. For example, Helbing and colleagues used a combination of real-world observations and computer simulations in order to suggest improvements to traffic flow in an intersection by separating inflow and outflow with physical barriers and placing an obstacle in the center27. Moussaïd and colleagues used a heuristics-based model to study high-density situations during a crowd disaster28. This approach suggested improvements to an environmental setting for mass events in order to avoid crowd disasters. With the aid of an existing open source framework, the implementation of such simulations could be relatively easy. SteerSuite is an open source framework that allows users to simulate steering algorithms and crowd behavior easily by providing tools for facilitating, benchmarking, and testing29. This framework can provide the core of an agent's navigation rationale, which is critical for successful crowd simulation. In addition, Singh and colleagues demonstrated a single platform that combines a variety of steering techniques30. While researchers can propose design interventions using such simulations, they are rarely validated with human participants in a controlled setting. Controlled experiments are rare in crowd research because they can be difficult to organize and dangerous to the participants.
VR has been employed to investigate social behavior using simple and complex virtual environments with one or more computer-simulated agents. In the study of Bode and colleagues31,32, the participants were asked to evacuate a simple virtual environment from a top-down perspective among several agents and found that exit choice was affected by static signage and motivation. Presenting participants with a more complex environment from a first-person perspective, Kinateder and colleagues found that the participants were more likely to follow a single computer-simulated agent during the escape from a virtual tunnel fire25. In a complex virtual environment with multiple agents, Drury and colleagues found that the participants tended to assist a fallen agent during an evacuation when they identified with the crowd26. Collectively, these findings suggest that VR can be an effective way of eliciting social behaviors, even with computer-simulated agents. However, some crowd behaviors may only be observed when there is a realistic social signal (i.e., when the participants are aware that the other avatars are controlled by people3). In order to address this shortcoming, the present protocol describes a method for conducting controlled experiments with multiple users in a networked VR setup. This approach has been employed in a recent study by Moussaid and colleagues in order to investigate the evacuation behavior of 36 networked participants3.
Research on networked VR has focused on topics unrelated to navigation strategies33,34 and/or relied on existing online gaming platforms such as Second Life. For example, Molka-Danielsen and Chabada investigated evacuation behavior in terms of exit choice and spatial knowledge of the building using participants recruited among existing users of Second Life35. While the authors provide some descriptive results (e.g., visualizations of trajectories), this study had difficulties with participant recruitment, experimental control, and generalization beyond this specific case. More recently, Normoyle and colleagues found that existing users of Second Life and participants in a laboratory were comparable in terms of evacuation performance and exit choice and different in terms of self-reported presence and frustration with the control interface36. The findings from these two studies highlight some of the challenges and opportunities afforded by online and laboratory experiments. Online studies are capable of drawing from a much larger and motivated population of potential participants. However, laboratory studies allow for more experimental control of the physical environment and potential distractions. In addition, online studies may pose some ethical concerns regarding data anonymity and confidentiality.
As a networked desktop VR laboratory, the Decision Science Laboratory (DeSciL) at ETH Zürich is primarily used to study economic decision-making and strategic interactions in a controlled environment. The technical infrastructure at the DeSciL consists of hardware, software for laboratory automation, and software that supports the multi-user desktop VR setup. The hardware includes high-performance desktop computers with Microsoft Windows 10 Enterprise operating system, control interfaces (e.g., mouse and keyboard, joysticks), headphones, and eye trackers (Table of Materials). All client computers are connected with Ethernet of one gigabit per second to the university network and the same network file share. There is no visible delay or lag when there are 36 clients connected. The number of frames per second is consistently above 100. The experiments are also managed and controlled with laboratory automation software based on Microsoft PowerShell (i.e., PowerShell Desired State Configuration and PowerShell Remoting). All relevant steps of the protocol are preprogrammed with PowerShell scripts called Cmdlets (e.g., Start-Computer, Stop-Computer). During the experiment, these scripts can be executed simultaneously and remotely on all client computers. This type of laboratory automation ensures an identical state of the client computers, reduces potential errors and complexity during scientific testing, and prevents researchers from having to perform repetitive manual tasks. For the navigation experiments, we use the Unity game engine (<https://unity3d.com/>) in order to support the development of 2D and 3D environments for multi-user, interactive desktop VR. The 36 client computers are connected to a server via an authoritative server architecture. At the start of every experiment, each client sends an instantiation request to the server, and the server responds by instantiating an avatar for that user on all of the connected machines. Each user's avatar has a camera with a 50 degrees field of view. Throughout the experiment, the clients send user' input to the server, and the server updates the movement of all of the clients.
In the physical laboratory, each computer is contained in a separate cubicle within three semi-independent rooms (Figure 1). The overall size of the laboratory is 170 m2 (150 m2 for experiment room and 20 m2 for control room). Each of these rooms is equipped with audio and video recording devices. Experiments are controlled from a separate adjacent room (i.e., by providing instructions and initiating the experimental program). From this control room, the experimenters can also observe the participants in both physical and virtual environments. Together with the Department of Economics at the University of Zürich, the DeSciL also maintains the University Registration Center for Study Participants, which was implemented based on h-root37.
Although similar systems have been described in the literature38, the DeSciL is the first functional laboratory that is suitable for multi-user desktop VR experiments on navigation and crowd behavior to our knowledge. Here, we describe the protocol for conducting an experiment in the DeSciL, present representative results from one study on social navigation behavior and discuss the potential and limitations of this system.
All methods described here have been approved by Research Ethics Committee of ETH Zürich as part of the proposal EK 2015-N-37.
1. Recruit Participants for the Planned Experimental Session.
2. Prepare the Experimental Session.
3. Conduct the Experiment.
4. Finalize the Experiment.
For each client on each trial, the experiment data from the DeSciL typically include trajectories, time stamps, and measures of performance (e.g., whether the participant turned in the "correct" direction at a particular intersection). A representative study investigated the effects of signage complexity on the route choice for a crowd of human participants (with virtual avatars) in a simple Y-shaped virtual environment. In this experiment, 28 participants (12 women and 16 men; mean age = 22.5) were given the same goal location (i.e., gate number) and were asked to choose the corresponding route option at the intersection using a map (see Figure 2).
The map complexity varied over 16 trials, and the hypothesis was that the decision time and accuracy would be higher for maps that are more complex. While we expect the decision accuracy to be relatively high overall, the participants' trajectories can be used in future experiments to define the walking paths of agents that convey a realistic social signal (i.e., believable movements). The total experiment time was approximately 1 h, including welcoming the participants, conducting the training session (for the control interface), and testing in the Y-shaped corridor. The obtained data are summarized in Table 1.
Figure 3 indicates the minimum and maximum completion times for each trial. These descriptive statistics provide an indirect measure of congestion during the trial. The obtained data also allows for the visualization of trajectories generated by the virtual crowd (see Figure 4). Spatial statistics can then be used to analyze the changes in trajectories over trials. For example, the researchers may be interested in how closely the participants followed each other or how smoothly the participants maneuver with particular control interfaces.
Figure 1: Photographs of the DeSciL laboratory. (a) The control room contains the server that receives traffic from the 36 client computers and monitors the participants in their cubicles. This room can be isolated from the testing rooms in terms of sound and vision. Communication to participants is provided via microphone and speaker system. (b) The three testing rooms contain 36 cubicles. (c) Each cubicle contains a desktop computer, a monitor, a mouse and a keyboard interface, headphones, and an eye tracker. Please click here to view a larger version of this figure.
Figure 2: Views of the Y-shaped virtual environment. (a) From the server, the researchers can observe the participants moving towards the intersection. (b) From the clients, the participants can view the virtual environment and other avatars from a first-person perspective during movement. Please click here to view a larger version of this figure.
Figure 3: Representative results from 16 experimental trials. The maximum and minimum times are the times required by the fastest and slowest participants to reach the destination on each trial. Please click here to view a larger version of this figure.
Figure 4: Participants trajectories from (a) trial 1 and (b) trial 16. The x- and y-axes represent the locations of the avatars in the crowd. The color bar represents time elapsed during the trial. Please click here to view a larger version of this figure.
Trial Number | MapType | Accuracy(%) | Average time/s |
1 | Simple | 100 | 42.01 |
2 | Complex | 96.4 | 40.51 |
3 | Simple | 100 | 39.15 |
4 | Complex | 100 | 38.66 |
5 | Complex | 100 | 38.52 |
6 | Complex | 100 | 38.87 |
7 | Simple | 100 | 38.43 |
8 | Complex | 100 | 38.26 |
9 | Simple | 100 | 37.43 |
10 | Simple | 100 | 38.44 |
11 | Complex | 100 | 37.08 |
12 | Complex | 100 | 36.8 |
13 | Simple | 100 | 37.67 |
14 | Complex | 100 | 36.52 |
15 | Simple | 100 | 36.83 |
16 | Simple | 100 | 37.88 |
Table 1: Representative results from 16 experimental trials. Decision accuracy represents the percentage of correct choices (i.e., turning towards the correct gate) over all participants. Mean decision time is the mean time required to reach the destination (whether correct or not) over all trials.
In this article, we described a multi-user desktop virtual reality laboratory in which up to 36 participants can interact and simultaneously navigate through various virtual environments. The experimental protocol details the steps necessary for this type of research and unique to multi-user scenarios. Considerations specific to these scenarios include the number of participants in attendance, the cost of seemingly small experimenter errors, rendering and networking capacities (both server- and client-side), training with the control interface, and data security. Overbooking participants is necessary in order to ensure a precise number of participants in an experimental session. If too few participants attend, then the cost of a failed experimental session is relatively high. Similarly, experimental errors can lead to a failed session when either the participants' data were contaminated before the error was detected, or the experiment cannot be conducted because of software or hardware failures. For example, if too much information is distributed through the network, then a relaunch of the entire system may be necessary. This is especially problematic if the experiment has already begun. In addition, the participants in virtual navigation experiments require experience and/or training with the control interface because the controls are less intuitive than real walking21 and the interaction with the controls can interfere with spatial memory tasks20. Responsible data management also becomes especially important given the large amount of data obtained per session.
While there are many opportunities afforded by the DeSciL, at least three limitations remain. First, the current system is setup for up to 36 simultaneous participants. Experiments on larger virtual crowds may require computer-controlled agents, traces of human participants from several previous sessions, or the capability of including online participants. Second, future hardware upgrades (e.g., for better graphics cards and better processors) will be much more expensive than for the traditional, single-user system. Third, multi-user desktop virtual reality research cannot yet be conducted with control interfaces that are more similar to real walking. Thus, research on locomotion and the physical interactions among participants is limited.
Despite these limitations, the DeSciL offers several advantages over real-world studies, single-user laboratory studies, and multi-user online studies. The software automation gives the researchers the abilities to adapt the experimental protocol with respect to their needs. Compared to both real-world and online studies, the DeSciL allows for more experimental control. For example, experiments in the DeSciL may employ systematic variations of the environment and provide direct observation of the participants in both virtual and physical worlds. Compared to single-user desktop virtual reality studies with computer-controlled agents, the participants can interact with each other in real-time, and the emergent behavior of the virtual crowd is less reliant on the experimenter's preconceptions. Computer-controlled agents in VR often rely on scripted actions and do not adapt to the users' movements in real time. In contrast, networked desktop VR provides a more ecological context in which human-controlled avatars affect (and are affected by) each other's movements. In addition, this approach can inform the movement parameters (e.g., walking speed and hesitations) of future agent-based models in crowd research (e.g., for evacuation scenarios39). In general, multi-user desktop virtual reality studies allow for more precise measurement of spatial behavior and the detection of patterns that may have previously been overlooked.
Recently, the DeSciL has been successfully employed in a series of decision-making40,41 and navigation studies3,21. For example, Moussaid and colleagues used the multi-user desktop VR setup in order to study the effect of stress on crowd behavior during an evacuation3. In this study, the "correct" exit varied from trial to trial, and only a proportion of the participants were informed of the correct exit. The results indicated that participants under stress led to a more efficient evacuation, but this finding may be attributable to the way in which the collisions were implemented. In addition, participants tended to follow other avatars under stress, suggesting that a social signal was conveyed among the participants despite the lack of direct physical interaction. These results emphasize the advantages of multi-user VR compared to single-user VR with computer-controlled agents. Future studies will include the comparison of multi-user data acquired either online or in the laboratory, more complex environmental variations, and the addition of peripheral devices such as eye trackers or physiological devices. These advancements will allow for the collection of different types of complex behavioral data42. For example, low-cost eye trackers can be incorporated in order to monitor the participants' attention or detect coarsely areas of interest on the screen.
The authors have nothing to disclose.
The representative study was funded by the Swiss National Science Foundation as part of the grant "Wayfinding in Social Environments" (No. 100014_162428). We want to thank M. Moussaid for insightful discussions. We also want to thank C. Wilhelm, F. Thaler, H. Abdelrahman, S. Madjiheurem, A. Ingold, and A. Grossrieder for their work during the software development.
PC | Lenovo | IdeaCentre AIO 700 | 24’’ screen, 16 GB RAM, and SSDs. CPU: Intel core i7. GPU:NVidia GeForce GTX 950A |
Keyboard | Lenovo | LXH-EKB-10YA | |
Mouse | Lenovo | SM-8825 | |
Eye tracker | Tobii Technology | Tobii EyeX | Data rate: 60 Hz. Tracking screen size: Up to 27″ |
Communication audio system | Biamp Systems | Networked paging station – 1 | Ethernet:100BaseTX |