High-density electroencephalography (dEEG) is being used increasingly to study brain development and plasticity in the early years of life. Here we present an application of sophisticated analysis techniques that builds on traditional EEG recording to understand the oscillatory dynamics of rapid auditory processing in the infant brain.
Rapid auditory processing and acoustic change detection abilities play a critical role in allowing human infants to efficiently process the fine spectral and temporal changes that are characteristic of human language. These abilities lay the foundation for effective language acquisition; allowing infants to hone in on the sounds of their native language. Invasive procedures in animals and scalp-recorded potentials from human adults suggest that simultaneous, rhythmic activity (oscillations) between and within brain regions are fundamental to sensory development; determining the resolution with which incoming stimuli are parsed. At this time, little is known about oscillatory dynamics in human infant development. However, animal neurophysiology and adult EEG data provide the basis for a strong hypothesis that rapid auditory processing in infants is mediated by oscillatory synchrony in discrete frequency bands. In order to investigate this, 128-channel, high-density EEG responses of 4-month old infants to frequency change in tone pairs, presented in two rate conditions (Rapid: 70 msec ISI and Control: 300 msec ISI) were examined. To determine the frequency band and magnitude of activity, auditory evoked response averages were first co-registered with age-appropriate brain templates. Next, the principal components of the response were identified and localized using a two-dipole model of brain activity. Single-trial analysis of oscillatory power showed a robust index of frequency change processing in bursts of Theta band (3 – 8 Hz) activity in both right and left auditory cortices, with left activation more prominent in the Rapid condition. These methods have produced data that are not only some of the first reported evoked oscillations analyses in infants, but are also, importantly, the product of a well-established method of recording and analyzing clean, meticulously collected, infant EEG and ERPs. In this article, we describe our method for infant EEG net application, recording, dynamic brain response analysis, and representative results.
Across a wide spectrum of developmental disorders, it is becoming increasingly clear that the key to early identification and ultimately remediation lies in understanding the early mechanisms that come into play as the developing brain assembles functional networks. Thus, there is increased interest in understanding the temporal dynamics of neural patterns that impact cognition. In particular, specific cognitive functions to be differentially correlated with oscillatory activity in specific frequency bands (e.g., cyclic fluctuations single-cell or population membrane potentials) 1. Previous studies have established that oscillatory dynamics play a crucial role in the activity-dependent self-organization of developing networks2-4, control neuronal excitability5,6 and integrate sensory inputs7,8. Oscillatory brain activity is thought to be metabolically beneficial9,10, increasing the efficiency of a variety of sensory processing functions and coordination of higher-level functions such as cognition and language. However, systematic investigation of the role of neural synchrony across age and links with behavioral outcomes in human infants has yet to be accomplished. An important step toward this objective is to achieve a deeper understanding of the emergence and maturation of the temporal dynamics and oscillatory mechanisms that support developing cognitive processes including early language.
A crucial component of language development is the ability to accurately process and categorize acoustic signals that change rapidly: often on the order of as little as tens of milliseconds. For example, the acoustic dynamics of the words “dad” and “bad” differ acoustically only over the first 40 msec of the syllable, yet the two have very different meanings and associations. Previous studies show a maturational trajectory of receptive ability for acoustic and linguistic differences. As early as 2 months of age, infants show the ability to discriminate rapid frequency changes (e.g., < 100 msec); suggesting that the “hardware” for detecting the difference between two acoustically similar syllables is in place. Over the next few months, babies can discriminate increasingly smaller differences, develop categorical perception, and exhibit cortical specialization for sounds of the native language syllables11-14. Because complex sound perception relies on the function of basic processing mechanisms, it is thought that deficits in the ability to perceive rapidly changing acoustic differences – even for simple sounds such as tones – may be early indicators15 of later language impairment.
Previous work from Choudhury and Benasich in this laboratory strongly supports this hypothesis, showing that an infant’s ability to process very rapid changes in simple sounds (e.g., tones) can predict 3- and 4-year language and cognitive abilities16,17. These data verify that the brain responses of pre-lingual infants can provide a quantifiable indicator of auditory processing and developmental progress. The study and methods presented here probe key aspects of the underlying mechanism of this relationship. Several lines of research now indicate that peak latency and amplitude of ERP waves arise from the summation of spectrotemporal dynamics in EEG oscillations of multiple generators18-23. Spectrotemporal analysis also allows the separation of phase and power information. Phase-locked activity reflects the part of the neuronal response that is evoked by the stimulus. This type of information is similar to what can be extracted from the ERP, since responses are averaged relative to a time-locked event. However, the timing of some neuronal activity may vary from trial to trial. In ERP analysis, this activity is “averaged out”; however in analysis of power changes from trial to trial, this information can be recovered and analyzed. Therefore, spectrotemporal analysis of phase and power may give additional information about the neuronal response, relative to the conventional ERP. Regarding infant development, there is considerable evidence that oscillations contribute to the development of neural circuits in animal models2,3 but these mechanisms are only beginning to be investigated in the human population. Work from this laboratory has shown theta and gamma oscillatory correlates of native language specialization at 6-months24. This highlights the functionality of oscillatory hierarchies in infancy.
The global hypothesis, based on the evidence presented above, is that synchrony of evoked oscillations in auditory cortices supports infant brain development. As a first step in testing this hypothesis, a “baseline” of processing in early infancy was obtained; namely, 4-months-of-age, which is currently thought to precede “perceptual narrowing” for native language specialization25,26. Accordingly, we performed single-trial frequency analysis on infant EEG data recorded during passive listening to pitch-variant and pitch-invariant tone pairs presented in an “oddball paradigm” consisting of two rate conditions (Control condition: 300 msec inter-stimulus interval; Rapid condition: 70 msec inter-stimulus-interval).
Here we illustrate this method using stimuli from studies focusing on rapid auditory processing. In these studies, an “oddball paradigm”, was used to assess neuronal activity to unpredictable, but recognizable events. In this paradigm, the brain response to unpredictable or “odd” stimuli are often called “Deviant” responses, whereas the response for the predictable stimulus, presented most of the time, is usually called the “Standard” brain response. Responses to stimuli presented in an oddball paradigm can be automatically elicited without focused attention, making this paradigm easy to use with very young infants. All of the auditory stimuli are presented via free-field speakers at intervals, which vary depending on the study. As mentioned previously, in the current study sounds that index rapid auditory processing (RAP) abilities were used: that is, sounds containing tens-of-milliseconds of acoustic change16,17,27,28. It may be noted that many other stimulus types are useful for testing neurophysiological discrimination, including consonant-vowel (CV) sounds as well as deviants reflecting changes in Frequency or Duration, with an interposed Gap, and/or ascending or descending frequency Sweeps. Finally, we also recommend recording spontaneous EEG during “quiet play” in which no auditory stimulus is presented. These data may then be used to measure oscillatory coupling and coherence in the absence of repeated stimulation.
Recording EEG activity from an infant population poses a set of unique challenges. For example, cooperation with placement of the electrodes and leaving them in place for the duration of the experiment, minimizing movement to prevent EEG artifacts, and keeping the baby engaged and distracted with silent toys all represent challenges. Additionally, infant data do not easily lend themselves to straightforward applications of protocols developed with adult/older child data. In many instances the relationship between components observed in infant EEG and event-related potentials (ERPs) is not as clear cut nor does it always map on to what is accepted in the adult. While developmental research holds a powerful potential for understanding the genesis of typical and disordered brain function, recording reliable and interpretable brain responses from human infants requires a high level of proficiency in both technical and interpersonal realms. These challenges, however, can be overcome and reliable EEG and ERP data can be recorded from infants of different ages using a variety of paradigms. Here we describe a general method of analysis utilizing commercially available ERP recording and analysis software in combination with a free, open-source ERP analysis package that works in the MATLAB environment29.
The application of oscillatory analysis methods to infant brain response recordings allows exploration of more mechanistic questions of neuronal synchrony development in relation to language acquisition and putative underlying mechanisms when that synchrony is compromised. Related efforts using other stimuli, such as speech syllables24, and analysis of spontaneous or “resting” oscillations1 in longitudinal analyses or in combination with early training paradigms, offer windows into the temporal, spatial, and spectral dynamics of typical and disordered developmental trajectories. It is hoped that these efforts will increase our understanding of the bases of auditory development and plasticity, and aid in identification and remediation strategies for developmental language disorders.
All work with human subjects requires Institutional Review Board approval and oversight. Methods reported here, when used in research, have been reviewed and approved by the Human Subjects Protection Program through the Rutgers Arts and Sciences Institutional Review Board (IRB).
1. Preparation
2. Net Application
3. Stimuli Presentation and EEG Recording
4. Data Processing – ERPs
Visually inspect the raw EEG data and reject segments with high-amplitude artifact.
NOTE: Reject channels with high amplitude and interpolate. Maximum percent of rejected channels should be set at 30%. Alternative methods (e.g., ICA, PCA, see also reference 30) may be employed to reduce or reject artifacts present in the data.
5. Data Processing – Source Localization
For infant data, co-register each individual and grand average ERP file with either an age-appropriate MR template or an individual MR scan (refer to previous publications 31,32).
NOTE: In the co-registration process, the electrode positions and reconstructed head are registered into a single coordinate system. Grand averages may be used to define the dipole model.
Estimate the number and location of underlying sources to be fitted to the data. For an auditory paradigm, use two dipoles with free location and rotation.
NOTE: Source estimation is then automatically guided through a minimization of a cost function that is a weighted combination of 4 residual fit criteria to obtain the “best fit” location to the time window of interest.
6. Data Processing – Time-Frequency Analysis in Source Space
Infant Event-related Potentials
Infant ERPs are generally larger than adult ERPs, and may have fewer or more peaks of activation, relative to mature responses, depending on the age 44. Here, we show representative Grand Average responses from twenty three 4-month-old infants 43 (Figure 2). The oddball paradigm allows us to determine whether the infant’s brain can recognize the difference between two events. In the representative results, the tone-variant, deviant response (DEV, 800-1,200Hz, red line) elicits an additional peak of activation, relative to the invariant tone pairs (STD, 800-800Hz, black line). This finding is apparent in both Control rate (300 msec ISI, left) and Rapid Rate (70 msec ISI, right) conditions. Example responses from electrodes of Fz (Frontal midline), C3 (Central, right) and C4 (Central, left) are shown. The computed difference wave (Deviant minus Standard) is also shown in gray lines. The additional peak of activation suggests that the infant brain at this age can discriminate the difference between the tones at both rate presentations.
Infant source waveforms
Source activity with little residual variance should follow the ERP peaks, signifying a “good fit” between the original data and the source localized transformed data. In the representative data, we show the location of the two-dipole best fit source model of the infant grand average ERP to the STD (tone-invariant) condition over the CLARA distributed model (Figure 3). The computation clearly shows left and right auditory activation in Control and Rapid Rate conditions.
Peaks of activity from the two-dipole model (Figure 4) corresponded to the ERP response very well. The peak timing and morphology of the ERP waveforms, shown in panel (i), match the timing and morphology of the source waveforms shown in panel (ii) (for more details, see original article, 43). Source waveforms from this experiment explained 97.9% of the variance in activity over the scalp electrodes. Statistical analysis of the source peak latencies showed that right hemisphere activity was faster than the left in both conditions, and responses in the rapid rate were later in both hemispheres than in the control condition. Hemispheric differences were not observed using the ERP data, suggesting that the source localization techniques enabled the retrieval of additional information from the responses.
Infant Event-related Oscillations
In general, time-frequency analyses of adult and animal data show that stimuli evoke a 1/f pattern of neuronal synchrony (e.g., decreasing power with increasing frequency). In the representative data, evoked by auditory tone pairs, we show that infants also express this pattern (Figure 5). Here, stimulus onset elicits synchronous bursts of theta (5-6 Hz), beta (20-25) Hz and gamma (35-45 Hz) power in both right and left auditory regions of the brain.
Animal models and adult experiments suggest that oscillatory synchrony, and in particular low- to mid-frequency oscillations (e.g., 1-8 Hz) are major contributors to evoked potentials 45. Analysis of instantaneous power shifts (Temporal Spectral Evolution, TSE) in infant oscillations from our previous publication 43 showed greater induced power to the variant tone in the theta band (6-8 Hz), relative to the invariant tone. This effect was observed in both rate conditions, particularly over the right auditory region in the Control rate condition (Figure 6). Rapid rate presentation yielded a more bilaterally symmetrical activity, suggesting enhanced left cortical involvement during auditory processing of rapidly occurring stimuli and in particular during acoustic change processing.
Figure 1. Steps of time-frequency analysis. Time-frequency analysis method is illustrated using grand average (n = 12) data from 4-month-old infants during the 70 msec ISI tone condition. Stimulus onsets are shown in red arrows beneath the time axis. Steps of analysis: (1) Averaged ERPs, shown in Cz electrode, are created for each channel. (2) Source location of ERP generators, shown in a sketch head, is obtained by using a 2-dipole model in data mapped onto an infant MRI template. (3) Individual and grand average source waveforms are obtained from the fit of the Left and Right dipoles. Infant head models show the voltage maps corresponding to the selected peak (in gray). (4) The source montage is applied to the 128 channel scalp data, and amplitudes are computed and saved for the two source channels. (5) Event related oscillations are calculated from single-trials and averaged over the response period. Please click here to view a larger version of this figure.
Figure 2. Event-related potential morphology. Grand Averages (n = 23) to Rapid (70 msec ISI) and Control (300 msec ISI) rate responses to standard (STD, black lines) and deviant (DEV, red lines) tone pairs are shown in frontal midline and central left and right electrodes. Negativity is plotted up. Stimulus onsets are shown in red arrows beneath the time axis at Fz. P1 is shown in the Fz panel with a black arrow. The difference wave (response to DEV minus response to STD) is shown in gray lines (Adapted from 43). Please click here to view a larger version of this figure.
Figure 3. Source localization results. Two-dipole “best fit” source model is shown overlaid on distributed activity from the source model. Clear left and right activity can be seen over left and right temporal lobe regions. (Adapted from 43). Please click here to view a larger version of this figure.
Figure 4. Event-related Potential and Source Waveform Comparison. (i) Example ERPs from frontal left and right electrodes (F3 and F4) show peaks of activation to tone pairs with invariant and variant fundamental frequencies (STD and DEV, respectively). A change in frequency elicits larger peaks ~ 400 msec (DEV, red line), relative to the when frequencies are unchanged (STD, black). (ii) The latency of peaks of activation is similar for the source-localized dipole activity, suggesting a good match between ERP and source waveform analysis. The large peak at 400 msec is particularly noticeable in the right hemisphere with the source-localized data. For simplicity, only the responses to the Rapid Rate condition are shown, however a similar match was also observed between ERP and source waveforms for the responses in the Control Rate condition. Please click here to view a larger version of this figure.
Figure 5. Pooled TSE maps are expressed in terms of percent spectral change over an epoch of -1 to 1 sec of time for left and right generators. (i) Tones in the 300 msec ISI condition elicit event-related oscillations in coherent frequency bands around stimulus onset (e.g., -1,140 msec and 0 msec). A long stimulus epoch is used in order to visualize more of the data and to provide a long enough sample for frequency decomposition. Right panel shows the average spectrum over the initial processing peak (150 – 300 msec). The average spectrum shows an overall 1/f spectrum with discrete peaks of synchrony at specific frequency bands. (ii) A similar pattern is observed for the 70 msec ISI condition. Please click here to view a larger version of this figure.
Figure 6. Time-frequency analysis of event-related oscillations in 4 month-old infants. Change in oscillatory power is shown inTemporal Spectral Evolution (TSE) grand average plots for 4-month-old infants in the Control (A) and Rapid Rate (B) conditions. Black bars on the x-axis illustrate tone onset and durations. Left and Right source activity is indicated in the top left corner of each graph. First row: (i) Responses to tone pairs with invariant frequency (STD) show power changes in the delta-theta range. Middle row: (ii) Responses to tone pairs with a frequency change in the second tone (DEV) show enhanced delta-theta power at the second tone, relative to STD responses, particularly in the Right auditory region in the Control condition. Third row: Difference plots between STD and DEV responses show a right lateralized increase in power in the Control Rate (A.iii) and bilateral power difference in the Rapid Rate (B.iii). Significant differences between STD and DEV response in the time-frequency domain are shown in black outline. (Adapted from 43). Please click here to view a larger version of this figure.
The research method described here describes how to facilitate a deeper understanding of spectrotemporal dynamics and anatomical location of high-density auditory-evoked EEG and ERP brain responses in infants. There are four critical steps within this protocol that facilitate analysis. First, proper net application and positioning with minimal caregiver and infant distress is the foundation for recording clean EEG in non-sedated paradigms. Proper head measurement and net size selection as well as the use of a net assistant and entertainer during the application process is key to accomplishing this step. Second, it is important to establish a calm, quiet and playful atmosphere for the family during the testing session, a condition facilitated by the primary tester, net assistant and the entertainer, who engages the infant in quiet play. Third, for data analysis, it is critical that age-appropriate MRI head models be used for source localization. The head size, bone and skin and cerebrospinal space must be accurate for the age tested in order to obtain the most precise localization results. Finally, for cortical responses in general, it is also critical that a high-density net be used (e.g., at least 64 channels of data) in order to optimize the chances of obtaining low-artifact recordings.
One limitation of this technique is that source localization of EEG data is not the gold standard for site of activity tests. One must keep in mind that the forward model of localization even with the best head models and measurements are still estimates of activity location. Therefore, it is essential to design the experiment in such a way that information regarding source activity may be compared across experimental conditions or groups. In addition, infant testing in general and in particular, longitudinal study may be fraught with incomplete or missing data sets. Solutions to this problem are to a) maintain relationships with participating families; b) optimize a quiet, calm recording atmosphere for the infant and caregiver; and c) overestimate the subject pool. In our hands, with an experienced pediatric team, we have attained low dropout and minimal data loss rates. In a longitudinal sample of 211 infant recording sessions with 57 participants we show 98.6% data retention (e.g., 208 sessions that resulted in usable data) and a 10% drop out rate (e.g., 6 participants were unable to continue after beginning the experiment). An advantage of EEG over other techniques, such as MEG and NIRS, is that subcortically biased activity is accessible with different filter bands. In addition, it is easier to control for movement as the electrodes travel with the head.
Once this protocol is mastered, the experimental applications of infant EEG and oscillatory dynamics are abundant. It is clear that we must first understand typically developing cortical networks in order to identify those that are atypically organized. This suggests the need for the creation of a model in which the integrity of early auditory processing mechanisms (including oscillations) plays a role in the generation and plasticity of sound representation as auditory experiences are incorporated and, ideally, learned. According to this model, nonlinguistic processing deficits may be associated with symptoms years, or in some cases decades, before formal diagnosis occurs.
Future investigations are needed to understand further details, including the function of frequency-band-specific oscillatory dynamics, cross-frequency phase coupling and regional inhibitory/excitatory patterns across early development. In addition, subcortical activity and testing in different states, such as sleep, are needed to give a more complete picture of typical development. We believe research with this technique will provide important insight into the process by which 'neurotypical' and atypical oscillatory dynamics organize and interact with emerging cognitive and language abilities.
The authors have nothing to disclose.
The authors gratefully acknowledge support for this research by the Elizabeth H. Solomon Center for Neurodevelopmental Research and NSF grant #SMA-1041755 to the Temporal Dynamics of Learning Center, an NSF Science of Learning Center. Special thanks are also due to the families who participated, and to the members of the Infancy Studies Laboratory for their practical and intellectual contributions. Special thanks to Jarmo Hämäläinen for development of the source localization protocol and to Naseem Choudhury for her intellectual input.
EEG Amplifiers | EGI | 1301281 | |
Sensor Nets | EGI | C-GSN-128-1011-110 | Sizes of nets vary with age, by month |
EEG Recording Software | Net Station | 4604200 | |
Presentation Computer | Dell | 4608161 | |
Presentation Software | Eprime | 13102456-50 | |
Baby bottle warmer | Avent | Target or any baby store | |
Electrolyte solutuion (Potassium Chloride dry) | EGI | A-A-CC-KLL-1000-000 | |
Coban self-adherent wrap tape | Coban | 595573 | |
Measuring tape | Target or any baby store | ||
Washable Markers | Target or any baby store | ||
Pipettes | Comes with EGI amplifier setup | ||
Analysis Computer | Dell | ||
Analysis Software I | BESA | 3955054 | v5.3 |
Analysis Software II | Brain Voyager | 3955054 | |
Analysis Software III | EEGLAB/ERPLAB/ MassUnivariate Toolbox | Freeware MatLAB v2007b | |
Analysis Software IV | BESA Statistics | 3956341 |